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PEPTIDES AND PEPTIDOMIMETICS INHIBITING THE ONCOGENIC ACTION 

OF P21 RAS 

Field of the Invention 

This invention relates to peptides effective in inhibiting 
oncogenesis, particularly as related to inhibition of p2l ras and 
adenocarcinomas of the colon, pancreatic carcinomas, 
neuroblastomas, and other cancers which express the transformed 
sequence of the ras gene product . 

Background of the Invention 

ras protooncogenes are activated by characteristic point 
mutations in a wide variety of malignancies. The expressed p21 
ras proteins are oncogenic by virtue of single substituted amino 
acids, usually at position 12 or 61 of the 189-residue p21 ras 
gene product, ras proteins act as membrane -associated molecular 
switches that bind GTP and GDP and slowly hydrolyze GTP to GDP. 

Mutations in ras are associated with the vast majority of 
adenocarcinomas of the colon. Cancer of the colon is a highly 
treatable and often curable disease when it remains localized to 
the bowel. It is the second most frequently diagnosed malignancy 
in the United States as well as the second most common cause of 
cancer death. Surgery is the primary treatment and results in 
cure in approximately 50% of patients. Adenocarcinoma is the 
primary lesion in the majority of cases. Recurrence following 
surgery is a major problem and often is the ultimate cause of 
death. The prognosis for colon cancer patients is clearly 
related to the degree of penetration of the tumor through the 
bowel wall and the presence or absence of nodal involvement. For 
locally advanced disease, the role of radiation therapy in colon 
cancer is under clinical evaluation. There is no standard 
therapy for advanced colon cancer and no evidence that 
chemotherapy improves survival, although short-term palliation 
may be achieved in approximately 10-20% of patients. 
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Pancreatic carcinoma has a high incidence of K-ras 
mutations. Mutated K- ras sequences which can be identified by 
polymerase chain reaction utilizing allele-specif ic primers can 
even be found in the plasma or serum from patients with 
pancreatic carcinoma. The c-Ki- ras oncogene is activated by 
point mutations involving codon 12 in 72%-100% of primary 
pancreatic adenocarcinomas, but the gene is not activated in 
nonneoplastic tissues. Cancer of the exocrine pancreas is rarely 
curable. The highest cure rate (4%-12%) occurs if the tumor is 
truly localized to the pancreas. Unfortunately, this stage of 
disease accounts for fewer than 20% of cases and, even with 
surgical resection, results in little more than a 5% 5-year 
survival rate. For small cancers (less than 2 cm) in the head 
of the pancreas with no lymph node metastases and no extension 
15 beyond the "capsule" of the pancreas, the survival rate following 
resection of the head of the pancreas approaches 20%. Overall 
survival rate of all stages is less than 2% at 5 years with most 
patients dying within one year. Worldwide, very few patients 
with cancers of the pancreatic tail or uncinate process have been 
2 0 cured. 

Lung cancers also frequently involve ras mutations. Point 
mutations in codon 12 of the K- ras protooncogene occur more 
frequently in lung adenocarcinomas from smokers (30%) than they 
do in lung adenocarcinomas from nonsmokers (7%) , suggesting that 

25 smoking is an important factor in the induction of these 
mutations. The ras oncogene may thus be a specific target of the 
mutagenic activity of tobacco smoke, and suggest that DNA 
alterations at this site can occur early and irreversibly during 
the development of adenocarcinomas of the lung. 

30 Mutations in the ras protooncogenes are the most frequently 

observed molecular alteration in acute myeloid leukemia (AML) 
Whether ras mutations occur as late or relatively early events 
in the multistep process of myeloid transformation, remains an 
open question. There is significant evidence that the ras 

35 oncogene plays a role in experimental mammary carcinogenesis; the 
evidence in human breast cancer, however, is more limited. 
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Similarly, there is significant evidence that the ras 
oncogene plays a role in nitrosoamine- induced esophageal tumors 
in rats, but in human esophageal cancers ras gene mutations are 
more rarely found. However, it is probable that there is a 
5 significant role of mutated ras genes in both cell proliferation 
and malignant transformation of human esophageal cells. 

Certain human neuroblastomas also show a high incidence of 
oncogenic ras mutations. Indeed, one study suggested that 
expressions of the oncogene N-myc and p21 together as detected 

10 by immunohistochemical staining could be among the most reliable 
prognostic indicators in neuroblastoma patients. 

The ras proteins are key regulators of the growth of 
eukaryotic cells. Some of the direct targets are unknown. These 
target proteins include raf -1 , gap , phosphatidylinositol-3 - 

15 hydroxykinase and, very recently, two nuclear proteins, C-JUN and 
its kinase (JNK) . The three-dimensional x-ray crystal structure 
for a ras -related protein bound to a domain of raf -1 has been 
elucidated. The ras -related protein ( rak-l-a ) binds to raf 
directly, utilizing residues contained in a sequence involving 

20 amino acids 35-37. All of the contact residues in the ras - 
related protein are homologous to those in the corresponding 
segment of ras-p-21. One of the inventors has shown that the p- 
21 ras protein (35-47 segment) selectively inhibits the mitogenic 
effects of oncogenic ras-p-21. 

25 In addition to its role as an oncogene, the activation of 

ras proteins is a key step in the signal transduction pathways 
triggered by ligand-bound cell surface receptors, such as the 
insulin receptor. 

The classical target of the ras protein is the GTPase 

30 activating protein GAP. This target protein is thought to play 
an essential role in the regulation of ras activity by increasing 
the GTPase activity of wild type, but not transformed ras . On 
the other hand, there is a considerable superfamily of these GAP- 
related proteins, which includes pl20-GAP. Other target proteins 

35 besides mammalian gap itself include (1) IRA1 and IRA2 , the 
functional equivalents of GAP in yeast. They regulate the 
ras-cyclic AMP pathway, controlling cell growth; (2) sari, the 
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fission yeast protein that regulates rasl in that organism; (3) 
BUD2, a yeast protein that activates BUD1/RSR1 which participates 
in the regulation of bud-site selection; (4) Human 
neurof ibromitosis (gene NF1) . NF1 is associated with type l 
5 neurofibromatosis, one of the most frequently inherited genetic 
diseases characterized, in part, by multiple neural tumors. NF1 
has been shown genetically and biochemically to interact with 
and stimulate the GTPase activity of ras; (5) Drosophila Gapl, 
which acts as a negative regulator of signalling by the Sevenless 

10 (SOS) receptor tyrosine kinase involved in eye development. 

Human SOS1 and SOS2 genes have also been recently identified 
which encode proteins that control GDP-->GTP exchange on ras 
proteins and are involved in signal transduction by tyrosine 
kinase receptors. In situ hybridization shows that SOS1 maps to 

15 2p22-->pl6 and SOS2 to 14q21-->q22 in the human genome. 

Another important target of ras is raf . The protein encoded 
by the c-raf-1 protooncogene is thought to function downstream 
of p21 ras because disruption of raf blocks signalling by ras in 
a number of systems. A highly-conserved 81 residue region of the 

20 N-terminus of raf protein has been to be shown to be critical as 
the ras protein interaction region. Importantly, the raf gene 
product interacts with both wild- type and activated ras protein. 
In one study, approximately 50% of the clones identified as 
interacting with ras were encoded portions of the c-raf and A -raf 

25 serine/threonine kinases. Thus, ras and the N- terminal region 
of raf protein associate directly in vitro and this interaction 
is dependent on GTP bound to ras . 

Within the superfamily of ras -related GTP-binding proteins, 
only the ras protein itself has been shown to act as an oncogenic 

3 0 protein. Many other proteins, however, have substantial amino 
acid homology to ras . This ras superfamily of GTP-binding 
proteins (> 50 members) regulates a diverse spectrum of 
intracellular processes. These include cellular proliferation 
and differentiation, intracellular vesicular trafficking, 

35 cytoskeletal control, NADPH oxidase function, as well as others. 

Some of these homologs may have biological activities which are 
related to ras . For example, rhoA encodes a ras -related 
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GTP-binding protein that was thought principally to play a role 
in cytoskeletal organization. Recent evidence, however, has 
suggested both that rhoA could act either as a dominant oncogene, 
since transfection of both normal and activated rho genes confer 
5 a transformed phenotype on fibroblast cells in culture, or as a 
recessive tumor suppressor gene, by virtue, in part, of its 
chromosomal location at 3p21, a site deleted in many human 
malignancies. Thus, it is important to consider these ras 
homologs as potentially involved in cell growth and 
10 transformation . 

Azatyrosine strongly inhibits oncogenic ras-p-21. This 
small molecule induces the rrcr gene, which encodes a proteinase 
se q Uence showing 90% amino acid sequence identity to lysyl 
oxidase . 

15 To acquire transforming potential, the precursor of the ras 

oncoprotein must undergo f arnesylation or similar modification 
of the cysteine residue located in a carboxyl- terminal 
tetrapeptide . These C-terminal lipid modifications are essential 
for the interaction of ras -related proteins with membranes. 

20 While all ras proteins are farnesylated and some palmitoylated, 
the majority of other ras -related proteins are 
geranylgeranylated . Thus selective peptide and peptidomimet ic 
inhibitors of ras lipidation have found potential utility as 
ant i -oncogenic agents . 

25 In view of the foregoing, there is there a longfelt need in 

the art for agents which inhibit the transforming ability of ras . 
As described above, selective peptide and peptidomimet ic 
inhibitors or ras lipidation have found potential utility as 
anti-oncogenic agents (Kohl et al . (1993) Science 260. : 1934 - 193 7 ; 

30 James et al . (1993) Science 260 : 1937-1942) . Similarly, FR 
patents 2694296 and 2690162 teach that peptides derived from the 
GAP protein may serve to inhibit ras . However, neither '694296 
nor '690162 describes peptides derived from the ras protein 
itself. EP 203587 describes new ras oncogene polypeptides which 

35 are used for producing antibodies for immunogenic assays. 

However, these sequences are derived from ras and its homologs 
in the carboxyl terminal domain (residues 170 - 18 9 in SEQ ID 
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NO: 5) and are thus physically distant from and completely 
unrelated to any sequences claimed herein. Furthermore, these 
sequences were claimed for the production of antibodies, 
preferably by linking to an immunogenic carrier, and a claim for 
direct therapeutic application was not made. 

Thus, peptides constructed from ra£ and its homologs for 
therapeutic application, namely by interfering with downstream 
or upstream actions of ras itself, are useful. Furthermore, the 
method of identification of said peptides utilizing calculat ional 
approaches is believed novel and has unexpectedly led us to these 
cyclic peptides and peptidomimet ics disclosed herein. 

Summary of the Invention 

The present invention provides peptides, cyclized peptides 
and peptidomimetics capable of inhibiting the oncogenic action 
of p21 xas- The oncogenic ra& - inhibiting cyclized peptides 
correspond to domains of the oncogenic ras protein which are most 
flexible and important in "interacting with target proteins 
upstream and downstream from xa£. The peptidomimetics are 
obtained by molecular modeling, including the structural 
minimization techniques of molecular dynamics. 

The peptides are designated by the formulas: Val -Val -lie, 
Lys-Arg-Val, lie -Lys-Arg-Val -Lys -Asp (SEQ ID NO: 1) , Lys-Cys-Asp- 
Leu-Ala (SEQ ID NO: 2) , Cys-Asp-Leu-Ala-Ala-Arg-Thr (SEQ ID NO : 3 ) , 
Asp-Leu-Ala-Ala (SEQ ID NO:4) or physiologically acceptable salts 
of the foregoing peptides . 

Also provided in the present invention are cyclic analogues 
of the above peptides and certain peptides and cyclic peptides: 

cyclo [- R(l) R(2) Thr lie Glu Asp Ser Tyr Arg Lys Gin Val 
Val lie Asp R(3) R(4)-] (I); 

cyclo [-R(l) R(2) Val Val He R(3) R(4)-] (II); 

cyclo [-R(l) R{2) Tyr Arg Glu Gin He Lys Arg Val Lys Asp 
Ser Asp Asp Val Pro R(3) R(4)-] (III); 
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cyclo [-R(l) R(2) Lys Arg Val R{3) R(4)-] (IV); 

cyclo [-R(l) R(2) He Lys Arg Val Lys Asp R(3) R(4)-] (V) ; 

cyclo [-R(l) R(2) Gly Asn Lys Cys Asp Leu Ala Ala Arg Thr 

Val Glu R(3) R(4) -] (VI) ; 

cyclo [-R(l) R(2) Lys Cys Asp Leu Ala R(3) R(4)-] (VII); 

cyclo [-R(l) R(2) Cys Asp Leu Ala Ala Arg Thr R(3) R(4)] 
(VIII) ; 

cyclo [-R(l) R(2) Asp Leu Ala Ala R{3) R(4)-] (IX); and 



NH 2 



O Asp 
II I 

H 2 N-Thr-Ile-Glu-Asp-Ser-Tyr-Arg-Lys~Gln-NH-CH 2 -C-Val-NH-CH (X) 
15 CH 2 CH- 2 

or physiologically acceptable salts thereof. 

In cyclized peptide formulas (I) -(IX), R(l) R{2), R(3) and 
R(4) represent, in the most general case, any amino acid which 
can serve as an amino acid residue linker. Amino acid residue 

20 linkers are usually at least one residue and can be most often 

two to four residues, more often 1 to 10 residues, both ranges 
being inclusive. Typical amino acid residues useful for linking 
are tyrosine, cysteine, lysine, and glutamic and aspartic acid. 
Most preferably [R(D, R(2)] and [R{3) , R(4)] are each 

25 independently selected from either the group consisting of Glu, 
Gin, Asp, Asn or from the group consisting of Lys, Arg, Orn. 

The symbol - represents a bond between the carboxyl and 
amino termini by which R(l) and R(4) can be interconnected to 
each other via an lower alkenyl or lower alkynyl group, but most 

3 0 preferably by a branched or unbranched methylene bridge of type 
--(CH 2 ) m --or -- (CH 2 ) m --M-- (CH 2 ) m ,-- . In such an moiety, m and m' 
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are integers from 1 to 6, inclusive, and preferably from 1 to 3 , 
inclusive; and M is NH, N[R(5)], 0, S or CH-R(5), wherein R(5) 
is lower alkyl, cycloalkyl or aryl and is preferably methyl, 
ethyl, propyl, phenyl, X-phenyl, or heterocyclic, wherein X is 
C1-, CF 3 -, F-, substituted at the o-, m- , or p- positions on 
the phenyl group M can contain a part of another diamino acid 
within the same peptide, e.g., the omega amino group of the one 
residue can be so linked to such an unnatural amino acid residue 
in a terminal residue. 

Furthermore, any amino acid in the sequences provided 
hereinabove may be replaced with its D-analogue, with the proviso 
that not more than 50% of the total amino acids are so replaced. 
Similarly, a homologous conservative substitution for any amino 
acid is within the bounds of the present invention provided that 
substitution does not eliminate the oncogenic ras p21- inhibiting 
activity. Thus, depending on the applications for which the 
peptides according to the invention are intended, it is also 
possible to envisage intercalating between several amino acids, 
or even between all the amino acids, of the peptides defined 
above, dextrorotatory amino acids, and in particular 
dextrorotatory phenylalanine or dextrorotatory tryptophan, 
capable of preventing the action of the degradative enzymes in 
the cell environment and thus of increasing their activity. 
Another modification in this sense consists in replacing certain 
amino acids, for example of the isoleucine type, by leucine. 

In addition, a subject polypeptide can differ, unless 
otherwise specified, from the natural sequences shown above by 
the sequence being modified by terminal -NH 2 acylation, e.g., 
acetylation, or by terminal -carboxylamidat ion, e.g., with 
ammonia, alkylamines, and the like. 

This invention further relates to pept idomimet ics which 
model the critical semi-extended conformation of at least one 
peptide of or cyclic peptide of the present invention, 
exemplified by the compounds of Structure 1: 
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wherein the sidechain R attached at the carbon atom numbered 6 
on the sterol nucleus can be NH-CH 2 -CH 2 NH 3 * , alkylamino, 
arylamino, or aralkylamino group, and wherein the sidechain 
5 attached at the carbon number 3 can be replaced with -0-C(=0)- 
- (CH 2 ) m - -COOH, where m is an integer from 1 to 6, inclusive, 
preferably from 1 to 3 , inclusive, and more preferably 2, and one 
of X and Y at each position independently, can be one H, a small 
alkyl group of C x to C 3 , preferably C L ; a halogen, preferably F, 
10 or an amino group where the other of one of X and Y is H. 
Preferably, each of X and Y is H. 
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An exemplary compound falling within Structure 1 is 3 
malonoxy- 6 - ( 2 - aminoethyl ) aminocyclopentanoperhydrophenanthrene 
(Structure 2) . 

5 Detailed Description of the Invention 

The natural sequence of the human oncogenic ras p21 is given 
in SEQ ID NO : 5 . The crystal X-ray structure has been determined 
at high resolution for that portion of the human ras protein 
corresponding to residues 1 to 16 6 of SEQ ID NO : 5 . 

0 The regions of the p21 protein that are the most likely to 

change their conformations upon activation of the protein, e.g. 
by oncogenic amino acid substitutions have been computed using 
two different methods. Both methods are based on the principle 
that the linear sequence of amino acids in a protein determines 

5 its unique three-dimensional structure. Given an amino acid 
sequence of a polypeptide or protein, therefore, it should be 
possible to predict its three-dimensional structure. This task 
can be accomplished by using the principle that the observed 
three-dimensional structure of a protein is the one of lowest 

0 free energy. There are a vast number of possible structures a 
given polypeptide chain can adopt, but essentially only one of 
these is observed. To allow folding to occur, therefore, the 
interatomic interactions in the protein chain must greatly 
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stabilize its final folded form, i.e., lower its conformational 
energy substantially with respect to that of any other competing 
structure. Thus, to compute the lowest energy form of a protein, 
it is necessary to be able first to compute the conformational 
energy of a given conformation of the protein and then, second, 
to generate its low energy conformations, or a representative 
sampling of them. The structure of lowest conformational energy 
so computed is then predicted to be the observed structure of the 
protein. This structure may be the one determined by x-ray 
crystallography or by 2- or 3 -dimensional nuclear magnetic 
resonance (NMR) techniques. 

A set of potential energy functions, in the computer program 
ECEPP (Empirical Conformational Energies of Peptides Program) , 
have been developed that accurately compute the conformational 
energies of given conformations of proteins. The conformational 
energy of a peptide can be expressed in Equation 1. 



E tot " DR ij "13 ~ k R . . 



1*3 1*3 



ij k 



where E tot is the total conformational energy of the protein, the 
Q's are the charges on the i th and j th atoms; R xj is the distance 
between the i th and j th atoms, D is the dielectric constant, e ±j 
and p i:j are the lowest non-bonded (Lennard- Jones ) energy and the 
distance at this lowest energy between atoms i and j in the 
protein; A k is the torsional barrier to rotation around specific 
bonds; 6 k is the k ch dihedral angle in the protein; n is a 
degeneracy factor, i.e., 3 for single bonds and 2 for double 
bonds; and the sign in the last summation term is positive for 
single bonds and negative for double bonds such as occur in the 
peptide bond units. 

This equation shows the total conformational energy as the 
sum of three terms: the pairwise electrostatic interactions 
between the individual atoms of a protein, each of which has a 
partial charge, (first sum) ; a non-bonded energy term (second 
term) that consists of an attractive term that varies as the 
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inverse sixth power (tenth power for hydrogen -bonding atoms) of 
the distance between the atoms (from an induced dipole- induced 
dipole interaction term) and a repulsive term, from the overlap 
of electron shells, that varies as the inverse twelfth power of 
the interatomic distance; and finally a torsional term (third 
sum) that depends upon the bonds about which rotation takes 
place. All of the constants in these terms have been determined 
from experimental crystal packing data and reproduce the lattice 
constants of all of the crystal structures of small molecules to 
which they have been applied and, where measured, the sublimation 
energies of these crystals These potential functions have been 
used to compute the low energy minima for single terminally 
blocked amino acid residues, simple peptides, oligopeptides, 
polypeptides, and proteins with excellent agreement between the 
lowest energy predicted structures and the structures determined 
experimentally. These potentials have therefore been well- 
tested, are based on experimental data, and have proved to be 
reliable in prediction of structure from sequence. 

These potential functions have been used to compute the 
average structure for the ras-p-21 protein in its normal and in 
its oncogenic form using the perturbation method called the 
electrostatically-driven Monte Carlo method (EDMC) . Specific 
regions of the oncogenic p21 protein undergo large conformational 
changes compared with the structure of the normal, inactive 
protein. One of these regions has been found to be residues 35- 
47. All of the segments that change conformation in the 
oncogenic protein were found to be the most flexible in the 
normal, inactive protein. 

Of considerable significance has been the finding that a 
completely different method, viz. molecular dynamics, based upon 
a completely different set of potential functions, i.e. the 
program DISCOVER, yields identical results for the p21 protein. 

Molecular dynamics is based on the principle that the 
positions of the atoms of a molecule can be predicted as a 
function of time by solving Newton's equations of motion for the 
molecule. The force on the molecule is the negative of the first 
derivative of the potential function with respect to the 
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coordinates of each of the atoms. Newton's equations of motion 
are then integrated, using the Verlet algorithm, over a 
trajectory such that the low energy regions around the starting 
structure are computed. The trajectories are computed over time 
5 intervals such that the total energy converges to a low, constant 
value. The structures whose energies have converged are then 
used to compute an average structure. Comparison of the 
coordinates of the atoms of this average structure with those of 
the starting structure reveals regions of the protein whose 

10 conformations may change significantly. Furthermore, if the 
variance of the coordinates of regions of the low energy 
structures from the corresponding coordinates of the average 
structure are high, these regions can be identified as being 
flexible, i.e., are the ones most likely to be parts of effector 

15 domains. Within this algorithm, for the p21 protein, up to 2000 
water molecules have been generated around the protein in the 
molecular dynamics simulations performed thus far. 

Using these novel calclilational approaches, the present 
inventors have identified important peptide regions of the 

20 protein that are involved in the signal transduction process, and 
these peptides can be used to design anti -cancer agents, as 
taught herein. We have found that most particularly the 35-47, 
96-110 and 115-126 peptides have strong and specific anti- 
oncogenic p21 activity. Even more particularly, we found that 

2 5 these domains contain unique extended structures and/or short 
beta-bend structures which are hypothesized to account in large 
part for their biological uniqueness. This suggested that 
cyclization of the peptide structures to force the beta-bend 
conformation in place would serve to enhance therapeutic 

30 activity. 

The results of these studies indicate that a domain of 
particular interest is the domain from residues 35 through 47 of 
SEQ ID NO:5, i.e., Thr- Ile-Glu-Asp-Ser-Tyr-Arg-Lys -Gin- Val -Val - 
Ile-Asp (SEQ ID NO : 6 ) , of even more particular interest the 

35 peptide corresponding to residues 44 to 46 in SEQ ID NO : 5 , i.e., 
Val-Val-Ile, of still more interest is the sequence from residues 
96 to 110 of SEQ ID NO : 5 , i.e., Tyr-Arg-Glu -Gin- Ile-Lys-Arg- Val - 
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Lys-Asp-Ser-Asp-Asp-Val-Pro (SEQ ID NO:7), of even more 
particular interest is the sequence from residues 101-103 in SEQ 
ID NO:5, i.e., Lys-Arg-Val ; and the sequence corresponding to 
residues 100 to 105 in SEQ ID NO : 5 , i.e., Ile-Lys- Arg-Val -Lys-Asp 
5 (SEQ ID NO:l) ; the sequence corresponding to residues 115 to 126 

of SEQ ID NO:5, i.e., Gly-Asn-Lys-Cys -Asp-Leu-Ala-Ala- Arg-Thr- 
Val-Glu (SEQ ID NO: 8); and most particularly the sequence 
corresponding to residues 117 to 121 of SEQ ID NO: 5, i.e, Lys- 
Cys-Asp-Leu-Ala (SEQ ID NO : 2 ) and the sequence corresponding to 
10 residues 118 to 124 of SEQ ID NO: 5, i.e., Cys-Asp-Leu-Ala-Ala- 

Arg-Thr (SEQ ID NO: 9) ; and the sequence corresponding to residues 
119 to 122 of SEQ ID NO : 5 , i.e., Asp- Leu- Ala-Ala (SEQ ID NO : 4 ) . 

Additional sequences homologous to the various preferred 
sequences recited hereinabove can be derived by one skilled in 

15 the art from the sequences of closely related ras proteins. Such 
sequences may possess enhanced therapeutic activity. Nonlimiting 
examples of such proteins closely related to the ras gene product 
which represent the parent sequences having identical or nearly 
identical three dimensional structures and from which homologs 

20 of the sequences given in the preceding paragraph can be derived 
by one normally skilled in the art are: 

ras-related protein Ara-3 [ Arabidopsis thaliana (mouse ear 
cress)] (SEQ ID NO:10); 



ras-related protein Ara-2 [A^ thaliana 1 SEQ ID NO: 11; 

25 ras-related protein Ara-1 [A^ thaliana l SEQ ID NO: 12; 

ras-related protein OraB-1 [ Discopyge ommata (electric 
ray) ] SEQ ID NO: 13 ; 



ras-related protein Rab-IA [ Lvmnea stagnalis (great pond 
snail)] SEQ ID NO: 14; 
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ras -related protein Rab-2 [" Homo sapiens (human) ] SEQ ID 
NO: 15; 

ras -related protein Rab-2 [ L. stagnalis ] SEQ ID NO: 16; 

ras -related protein Rab-2 f Orvctolagus cuniculus (rabbit)] 
5 SEQ ID NO: 17; 

ras -related protein Rab-2 [Rattus norvegicus (rat)] SEQ ID 
NO: 18; 

ras -related protein Rab-3 r Drosophila melanogaster 
(fruit fly)] SEQ ID NO: 19; 

10 ras -related protein Rab-4 [R^ norvegicus l SEQ ID NO: 20; 

ras -related protein Rab-6 [ Caenorhabditis elegans ] SEQ ID 
NO: 21; 

ras -related protein Rab-6 [ H. sapiens l SEQ ID NO: 22; 

ras -related protein Rab-7 [Canis f amiliaris (dog) ] SEQ ID 
15 NO: 23; 

ras -related protein Rab-7 [ Dictvostelium discoideum (slime 
mold)] SEQ ID NO:24; 

ras -related protein Rab-8 [ C . f amiliaris ] SEQ ID NO:25; 

ras -related protein RabC [D^ discoideum ] SEQ ID NO:26; 

20 ras -related protein Rac-1 [C_^ elegans ] SEQ ID NO:27; 

ras -related protein Rac-IA [D^ discoideum ] SEQ ID NO:28; 

ras -related protein RacB [D^ discoideum ] SEQ ID NO: 29; 
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ras-related protein RacC [D^ discoideum l SEQ ID NO: 30; 
ras-related protein Ral-A [H^ sapiens] SEQ ID NO:31; 
ras-related protein Ral-B [H^ sapiens ! SEQ ID NO: 32; 
ras -related protein O-Ral [D^ ommata] SEQ ID NO: 33; 
ras-related protein Ora-1 [D^ ommata] SEQ ID NO: 34; 
ras-related protein Ora-2 [IX ommata 1 SEQ ID NO: 35; 
ras-related protein Ora-3 [D^ ommata l SEQ ID NO: 36; 
ras-related protein Rap-1 [Eu discoideum l SEQ ID NO: 37; 
ras-related protein Rap-2A [H^ sapiens] SEQ ID NO: 38; 
ras -related protein Rap-2B [H^ sapiens ] SEQ ID NO: 39; 
ras-related protein O-KREV [D^ ommata] SEQ ID NO: 40; 
ras-related protein Rap-IA [H^ sapiens ] SEQ ID NO: 41; 
ras-related protein Rap-IB [H^ sapiens ! SEQ ID NO:42; 
ras-like protein GNROR3 [D^_ melanoaaster l SEQ ID NO:43; 
ras-like protein rasA [EK_ discoideum l SEQ ID NO: 44; 
ras-like protein ras B [ D . discoideum l SEQ ID NO: 45; 
ras-like protein rasC [D^ discoideum l SEQ ID NO: 46; 
ras-like protein rasG [D^ discoideum l SEQ ID NO: 47; 
ras -like protein F54C8.5 [C\_ eleaans l SEQ ID NO: 48; 
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ras -like protein CC- ras f Coprinus cinereus (inky cap 

fungus) ] 

SEQ ID NO: 49; 

ras -like protein [Geodia cydonium (sponge)] SEQ ID NO:50; 

5 ras -related protein Rab-10 [ C. f amiliaris l SEQ ID NO: 51; 

ras -related protein Rab-11 [ H. sapiens ] SEQ ID NO: 52. 

In addition, as described hereinabove, the therapeutic 
activity of these sequences is enhanced by cyclization. 
Cyclization forces and maintains the conformations of these 
10 peptides in unique structures like beta-bends. The following are 
representative, nonlimiting examples of cyclized peptides useful 
for inhibiting the oncogenic activity of the ras protein, said 
peptides having formulas as given below: 

cyclo [- R(l) R(2) THR ILE GLU ASP SER TYR ARG LYS GLN VAL VAL 
15 ILE ASP R{3) R(4) -] (I) 

cyclo [-R(l) R(2) VAL VAL ILE R{3) R{4)-] (II) 

cyclo [-R(l) R(2) TYR ARG GLU GLN ILE LYS ARG VAL LYS ASP SER ASP 
ASP VAL PRO R(3) R(4)-] (III) 

cyclo [~R(D R(2) LYS ARG VAL R(3) R<4)-] (IV) 

20 cyclo [-R(l) R(2) ILE LYS ARG VAL LYS ASP R{3) R(4)-] (V) 

cyclo [-R(l) R(2) GLY ASN LYS CYS ASP LEU ALA ALA ARG THR VAL GLU 
R(3) R(4)-] (VI) 

cyclo [-R(l) R(2) LYS CYS ASP LEU ALA R(3) R(4)-] (VII) 



cyclo [-R(l) R(2) CYS ASP LEU ALA ALA ARG THR R(3) R(4)-] (VIII) 
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cyclo [-R(l) R(2) ASP LEU ALA ALA R{3) R(4)-] (IX); and 

NH 2 
I 

O Asp 

H 2 N-Thr-Ile-GlU"Asp-Ser-Tyr-Arg-Lys-Gln-NH-CH 2 -C-Val-NH-CH (X) 

CH 2 CH 2 

or a physiologically acceptable salt thereof. 

10 in the aforementioned cyclized peptide formulas I -IX, R(l) 

R(2), R{3) and R(4) represent, in the most general case, any 
amino acid, such that they serve as amino acid residue linkers. 
Amino acid residue linkers are usually at least one residue and 
can be most often two to four residues, more often 1 to 10 

15 residues. Typical amino acid residues used for linking are 
tyrosine, cysteine, lysine, glutamic and aspartic acid. Most 
preferably [R(D, R(2)] and [R(3) , R(4)] independently are 
selected from either the groups [Glu, Gin, Asp, Asn] or [Lys, 
Arg, Orn] . 

20 The term - represents a bond between the carboxyl and amino 

termini by which R(l) and R{4) can be interconnected to each 
other via an lower alkyl, alkenyl or lower alkynyl group, but 
most preferably by a branched or unbranched methylene bridge of 
type --(CH 2 ) m --or - - (CH 2 ) m --M- - (CH 2 ) m , - - . In such an moiety, m 

25 and m' are integers from 1 to 6, inclusive, and preferably from 
1 to 3, inclusive; and M is NH, N[R(5)] , O, S or CH-R(5) , wherein 
R(5) is lower alkyl, cycloalkyl or aryl and is preferably methyl, 
ethyl, propyl, phenyl, X-phenyl, or heterocyclic, wherein X is 
C1-, CF 3 , F-, substituted at the o- , m- , or p- positions on the 

30 phenyl group. M can contain a part of another diamino acid 
within the same peptide, e.g., the omega amino group of the one 
residue can be so linked to such an unnatural amino acid residue 
in a terminal residue. 

Furthermore, any amino acid in the cyclized peptide 

35 sequences (I)-(X) recited herein may be replaced with its D- 
analogue, insofar as not more than 50% of the total amino acids 
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are so replaced. Similarly, a homologous conservative 

substitution for any amino acid is within the bounds of the 
present invention. Conservative substitutions include Glu for 
Asp, Gin for Asn and Val for lie, among others, as well-known to 
5 the art. Depending on the applications for which the peptides 
according to the invention are intended, it is also possible to 
intercalate between several amino acids, or even between all the 
amino acids, of the peptides defined above, dextrorotatory amino 
acids, and in particular dextrorotatory phenylalanine or 

10 dextrorotatory tryptophan, capable of preventing the action of 
the degradative enzymes in the cell environment and thus of 
increasing their activity. Another modification in this sense 
consists in replacing certain amino acids, for example of the 
proline type, by D- tryptophan . 

15 In addition, a subject polypeptide can differ, unless 

otherwise specified, from any of the natural sequences shown 
herein above by the sequence being modified by terminal -NH 2 
acylation, e.g., acetylation,* or by terminal -carboxylamidat ion, 
e.g., with ammonia, alkylamines, and the like. 

20 The placement of hydrophobic amino acid residues is highly 

dependent on the peptide sequence. For example, for the 3 5-47 
peptide sequence, corresponding to amino acids 35-47 of SEQ ID 
NO: 5, there is a distinct hydrophobic region for the amino acid 
residues corresponding to amino acids 44-46 of SEQ ID NO : 5 . The 

25 bridge in Compound (X) occurs at what corresponds in structure 
to amino acids 44-46 in SEQ ID NO : 5 . It is possible to extend 
this hydrophobic segment without sacrificing activity. For 
example, the carboxyl terminal Asp residue can be replaced with 
one or more hydrophobic residues such as Val or lie, and the 

30 result is greater efficiency in crossing cell membranes. 

Short half -lives of peptides, a major problem, can be at 
least partially extended by the addition of D-amino acids to 
either or both of the amino and carboxyl terminal ends of the 
peptide. These D-amino acid residues block the action of exo- 

3 5 proteases that degrade peptides from their amino or carboxyl 
ends. In addition, the cyclization of the peptide further 
renders the peptide less susceptible to proteolysis. 
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Recent advances in the field of peptides have been directed 
towards the stabilization of these peptides against enzymatic or 
hydrolytic degradation. It would be extremely valuable to 
stabilize these peptides from degradation by proteolytic enzymes 
5 in order to improve their pharmacokinetic properties. Enhanced 
resistance to enzymatic degradation would increase the usefulness 
of these peptides as therapeutic agents. However, since they 
only exhibit short half lives in vivo, large amounts of such 
peptides must typically be administered to a subject in order to 

10 achieve the desired effect. Alternatively, smaller quantities 
may be prescribed to an individual, but more frequent dosages 
would be required to achieve the same level of potency. 

It is further well-known to those normally skilled in the 
art that it is possible to replace peptides with peptidomimetics . 

15 Peptidomimetics are generally preferable as therapeutic agents 
to peptides owing to their enhanced bioavailability and relative 
lack of attack from proteolytic enzymes. The present inventors 
have used the techniques of molecular modeling supra, to design 
a peptidomimetic which mimics the critical beta-bend aspects of 

20 the peptide corresponding in sequence to amino acids 96-110 of 
SEQ ID NO: 5 (p21 ras) . The bend structure occurs at amino acids 
102-103 in the p21 ras protein. These residues have been 
implicated in the binding of ras p21 to SOS. 

Peptidomimetric compounds which inhibit the oncogenic or 

25 transforming activity of the p21 ras protein are provided by the 
compounds of Structure I : 
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wherein the sidechain R attached at the carbon atom numbered 6 
on the sterol nucleus can be NH-CH 2 -CH 2 NH 3 + , alkyl amino, 
arylamino, or aralkylamino group, and wherein the sidechain 
attached at the carbon number 3 can be replaced with -0-C(=0)- 
- (CH 2 ) m - -COOH, where m is an integer from 1 to 6, inclusive, 
preferably from 1 to 3, inclusive, and more preferably 2, and one 
of x and y at each position independently, can be one H, a small 
alkyl group of C x to C 3 , preferably C 1 ; a halogen, preferably F, 
or an amino group where the other of one of x and y is H. 
Preferably, each of x and y is H. 

Without wishing to be bound by any particular theory, the 
structure believed to be the optimally designed inhibiting 
peptidomimetic is illustrated below in Structure II: 




C=0 NH 

* OOC- CH 2 CH 2 

CH 2 

+ I 
NH 3 

STRUCTURE 2 



The instant invention comprises novel peptides of medicinal 
importance most particularly for the treatment of adenocarcinomas 
of the colon, pancreatic carcinomas, neuroblastomas, and other 
cancers of undefined germ cell origin which express the 
transformed sequence of the ras protein. These peptide sequences 
were unexpectedly obtained by the use of molecular dynamic 
simulations on ras p21 to define which domains of the protein 
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were most flexible and were thus most important in interacting 
with target proteins upstream and downstream from ras. These 
peptides are identified by the following amino acid sequences: 
Thr-Ile-Glu-Asp-Ser-Tyr-Arg-Lys-Gln-Val-Val-Ile-Asp (SEQ ID 
NO: 6) , Val-Val-Ile, Tyr-Arg-Glu-Gln- He -Lys- Arg- Val -Lys -Asp-Ser- 
Asp-Asp-Val-Pro (SEQIDNO:7), Lys-Arg-Val, He -Lys -Arg- Val - Lys - 
Asp (SEQ ID NO:l), Gly-Asn-Lys-Cys-Asp-Leu-Ala-Ala-Arg-Thr-Val- 
Glu (SEQ ID NO:8), Lys-Cys-Asp-Leu-Ala (SEQ ID NO:2), Cys-Asp- 
Leu-Ala-Ala-Arg-Thr (SEQ ID NO:9), and Asp-Leu-Ala-Ala (SEQ ID 
NO:4) . 



Including the cyclic analogues of the above peptides, 
namely: 

cyclo [- R(l) R(2) Thr-Ile-Glu-Asp-Ser-Tyr-Arg-Lys-Gln-Val-Val- 
Ile-Asp-R(3) R(4)-] (I); 

cyclo [-R(l) R(2) Val-Val-Ile-R(3) R(4)-] (II); 

cyclo [-R(l) R(2) Tyr-Arg-Glu-Gln- Ile-Lys-Arg-Val-Lys-Asp-Ser- 
Asp-Asp-Val-Pro-R(3) R(4)-] (III); 

cyclo [-R(l) R(2) Lys-Arg-Val R(3) R(4)-] (IV); 

cyclo [-R(l) R(2) Ile-Lys-Arg-Val-Lys-Asp R(3) R(4)-] (V) ; 

cyclo [-R(l) R(2) Gly-Asn-Lys-Cys-Asp-Leu-Ala-Ala-Arg-Thr-Val-Glu 
R(3) R(4)-] (VI); 



cyclo [-R(l) R(2) Lys-Cys-Asp-Leu-Ala R(3) R(4)-] (VII); 

cyclo [-R(l) R(2) Cys-Asp-Leu-Ala-Ala-Arg-Thr R(3) R(4)-] (VIII); 

cyclo [-R(l) R(2) Asp-Leu-Ala-Ala R (3) R(4)-] Z (IX); and 
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Asp 



H 2 N-Thr-Ile-Glu-Asp-Ser-Tyr-Arg-Lys-Gln-3STH-CH 2 -C-Val-NH-CH (X) 



CH, 



or a physiologically acceptable salt thereof. 

Wherein for cyclized peptide formulas designated by (I) - (IX) 

10 hereinabove, R(l) R(2), R(3) and R(4) represent, in the most 
general case, any amino acid, such that they serve as amino acid 
residue linkers. Amino acid residue linkers are usually at least 
one residue and can be most often two to four residues, more 
often 1 to 10 residues. Typical amino acid residues used for 

15 linking are tyrosine, cysteine, lysine, glutamic and aspartic 
acid, or the like. Most preferably [R ( 1 ) , R(2)] and [R(3), 
R(4)] independently are selected from either the groups [Glu, 
Gin, Asp, Asn] or [Lys , Arg, Orn] . 

The symbol - represents a bond between the carboxyl and 

2 0 amino termini by which R(l) and R(4) can be interconnected to 
each other via an lower alkenyl or lower alkynyl group, but most 
preferably by a branched or unbranched methylene bridge of type 
--(CH 2 ) m --or (CH 2 ) ra --M-- (CH 2 ) m ,-- . In such an moiety, m and m' 
are integers from 1 to 6 and preferably from 1 to 3 ; and M is NH, 

25 N[R{5)], O, S CH-R(5) or does not exist, wherein R(5) is lower 
alkyl, cycloalkyl or aryl and is preferably methyl, ethyl, 
propyl, phenyl, X-phenyl, or heterocyclic, wherein X is C1-, CF 3 - 
, F-, substituted at the o-, m- , or p- positions on the phenyl. 
M can contain a part of another diamino acid within the same 

30 peptide, e.g., the omega amino group of the one residue can be 
so linked to such an unnatural amino acid residue in a terminal 
residue . 

Furthermore, any amino acid in the sequences provided may 
be replaced with its D-analogue, insofar as not more than 50% of 
35 the total amino acids are so replaced. Conservative 
substitutions include Glu for Asp, Gin for Asn and Val for lie, 
among others, as is well known to those of ordinary skill in the 
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art. Similarly, a homologous conservative substitution for any 
amino acid is within the bounds of the present invention. 
Depending on the applications for which the peptides according 
to the invention are intended, it is also possible to envisage 
intercalating between several amino acids, or even between all 
the amino acids, of the peptides defined above, dextrorotatory 
amino acids, and in particular dextrorotatory phenylalanine or 
dextrorotatory tryptophan, capable of preventing the action of 
the degradative enzymes in the cell environment and thus of 
increasing their activity. Another modification in this sense 
consists in replacing certain amino acids, for example of the 
proline type, by D- tryptophan. 

In addition, a subject polypeptide can differ, unless 
otherwise specified, from the natural sequences shown above by 
the sequence being modified by terminal -NH 2 acylation, e.g., 
acetylation, or by terminal -carboxylamidat ion, e.g., with 
ammonia, alkylamines, and the like. 

The instant invention also comprises a method of use of the 
peptides supra for the treatment of adenocarcinomas of the colon, 
pancreatic carcinomas, neuroblastomas, and other cancers of 
undefined germ cell origin which express the transformed sequence 
of the ras protein. 

It is also an object of the present invention to provide 
peptides and cyclized peptide homologs from the sequences listed 
in SEQ ID NOS: 10-52. 

The amino acid residues described herein are preferred to 
be in the n L" isomeric form. However, residues in the "D" 
isomeric form can be substituted for any L-amino acid residue, 
as long as the desired functional property is retained by the 
polypeptide. NH 2 refers to the free amino group present at the 
amino terminus of a polypeptide. COOH refers to the free 
carboxyl group present at the carboxy terminus of a polypeptide. 
In keeping with standard polypeptide nomenclature described in 
J. Biol. Chem. 243:3552-3559 (1969) and adopted at 37 C.F.R. 
1.822(b) (2)), The list of variable amino acids, capable of 
participating in the composition of this peptide is as follows: 
Y, Tyr, tyrosine; G, Gly, glycine; F, Phe , phenylalanine; M, Met, 
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methionine; A, Ala, alanine; S, Ser, serine; I, He, isoleucine; 
L, Leu, leucine; T, Thr, threonine; V, Val, valine; P, Pro, 
proline; K, Lys , lysine; H, His, histidine; Q, Gin, glutamine; 
E, Glu, glutamic acid; W, Trp, tryptophan; R, Arg, arginine; 
D, Asp, aspartic acid; N, Asn, asparagine; C, Cys, cysteine. 

Amino acid residue sequences are presented herein in the 
conventional left- to-right direction of amino- terminus to 
carboxy- terminus . In addition, the phrase "amino acid residue" 
is broadly defined to include the amino acids listed hereinabove, 
and modified and unusual amino acids, such as those listed in 3 7 
C.F.R. 1.822(b)(4), incorporated herein by reference. 
Furthermore, it should be noted that a dash at the beginning or 
end of an amino acid residue sequence indicates either a peptide 
bond to a further sequence of one or more amino acid residues or 
a covalent bond to an amino or hydroxyl end group. 

Polypeptide and peptide are terms used interchangeably 
herein to designate a linear series of amino acid residues 
connected one to the other by peptide bonds between the 
alpha-amino and carboxy groups of adjacent residues. 

Protein is a term used herein to designate a linear series 
of greater than about 20 amino acid residues connected one to the 
other as in a polypeptide. 

The term synthetic peptide refers to a chemically produced 
chain of amino acid residues linked together by peptide bonds 
that is free of naturally occurring proteins and fragments 
thereof. The term peptide encompasses linear and cyclic 
peptides . 

(D,L), (D) , or (L) preceding the amino acid designation 
means that this amino acids exists in that specific isomeric 
form, i.e. (D,L) Phe means that the amino acid phenylalanine 
exists as a racemic mixture; (D) Phe or D-Phe means that the 
amino acid phenylalanine exists as the D-stereoisomer or implied 
R configuration; (L) Phe means that the amino acid phenylalanine 
exists as the L stereoisomer or implied S configuration. 

Alkyl as used herein means methyl, ethyl, n-propyl, 
isopropyl , n-butyl , sec-butyl , isobutyl , tert-butyl , n-pentyl , 
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neopentyl, 2 -methylbutyl , 1-methylbutyl , 1 -ethylpropyl , 

1. 1- dimethylpropyl, n-hexyl, 1 -methylpentyl , 2 -methylpentyl ( 
3-methylpentyl, 4 -me thylpentyl , 3 , 3 - dime t hy lbuty 1 , 

2.2- dimethylbutyl, 1, 1-dimethylbutyl, 2 -ethylbutyl , 1-ethylbutyl , 

1. 3- dimethylbutyl, n-heptyl, 5 -methylhexyl , 4 -methylhexyl , 
3-methylhexyl, 2 -methylhexyl , 1 -methylhexyl , 3 -e thylpentyl , 

2 - ethylpentyl , 1 - e thylpentyl , 4 , 4 - dime t hy lpen t y 1 , 
3, 3 -dime thylpentyl, 2 , 2 -dime thylpentyl , 1 , 1 -dime thylpentyl , 
n-octyl, 6-methylheptyl, 5 -methylheptyl , 4 -methylheptyl , 

3 - methylheptyl, 2 -methylheptyl, 1 -methylheptyl , 1 -ethylhexyl , 
1-propylpentyl, 3 -ethylhexyl , 5 , 5 - dime t hylhexyl , 

4.4- dimethylhexyl, 2 , 2 -diethylbutyl , 3 , 3 -diethylbutyl , 
1 -methyl - 1 -propylbutyl . 

Cycloalkyl refers to a hydrocarbon ring having from 3 to 7 
carbon atoms, inclusive. Examples of cycloalkyl groups are 
cyclopropyl, cyclopentyl, cycloheptyl, cyclooctyl, cyclononyl, 
and the like. 

The term aryl refers to aromatic groups which have at least 
one ring having a conjugated pi electron system and includes 
carbocyclic aryl, heterocyclic aryl aralkyl, and biaryl groups, 
all of which may be optionally substituted. 

Heterocyclic groups means groups having from 1 to 3 
heteroatoms as ring atoms in the aromatic ring and the remainder 
of the ring atoms carbon atoms. Suitable heteroatoms include 
oxygen, sulfur, and nitrogen, and their heterocyclic compounds 
can include furanyl, thienyl, pyridyl , pyrrolyl , N- lower alkyl 
pyrrolo, pyrimidyl, pyrazinyl, imidazolyl , and the like, all 
optionally substituted. 

Substituted heterocyclic refers to any heterocyclic aryl 
group substituted by a alkyl, aryl, cycloalkyl, halo, sulfonate, 
or trif luoromethyl group. 

The term alkyl amino refers to the groups - - NRR ' wherein 
respectively, (a) R is alkyl and R' is hydrogen or alkyl; (b) R 
is aryl and R' is hydrogen or aryl, (c) R is cycloalkyl and R' 
is hydrogen or alkyl, (d) R is hydrogen and R' is itself linear 
aminoalkyl, (e) R is alkyl and R' is itself linear aminoalkyl . 
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The term aminoalkyl refers to the groups - ( CH 2 ) m - NRR ' , 
wherein m is an integer from 1 to 6, inclusive and -NRR' is alkyl 
amino, as defined supra. 

Halo encompasses f luoro, chloro, bromo and iodo. 
5 The phrase protecting group, as used herein, means 

substituents which protect the reactive functional group from 
undesirable chemical reactions. Examples of such protecting 
groups include esters of carboxylic acids, ethers of alcohols and 
acetals and ketals of aldehydes and ketones. 

10 The phrase N-protecting group or N-protected as used herein 

means those groups intended to protect the N- terminus of an amino 
acid or peptide, to protect an amino group against undesirable 
reactions during synthetic procedures and includes, but is not 
limited to, sulfonyl, acetyl, pivaloyl, t-butyloxycarbonyl (Boc) , 

15 carbonylbenzyloxy (Cbz) , benzoyl and an L- or D-aminoacyl 
residue, which may itself be N-protected similarly. The 
deprotection conditions for the above protecting groups 
necessarily vary with the choice of protecting group. Thus, for 
example, an acyl group such as an alkanoyl , alkoxycarbonyl or an 

20 aroyl group may be removed for example, by hydrolysis with a 
suitable base such as an alkali metal hydroxide, for example 
lithium or sodium hydroxide. Alternatively an acyl group such 
as a t-butoxycarbonyl group may be removed, for example, by 
treatment with a suitable acid such as hydrochloric, sulfuric or 

25 phosphoric acid or trif luoroacetic acid and an 
arylmethoxycarbonyl group such as a benzyloxycarbonyl group may 
be removed, for example, by hydrogenation over a catalyst such 
as palladium-on-charcoal . 

The phrase COOH-protecting group or carboxyl-protecting 

3 0 group is, an esterifying group, for example an alkyl group 
(especially methyl or ethyl) or an arylmethyl group (especially 
benzyl) . The deprotection conditions for the above protecting 
groups necessarily vary with the choice of protecting group. 
Thus, for example, an esterifying group such as an alkyl or 

3 5 arylmethyl group may be removed, for example, by hydrolysis with 
a suitable base such as an alkali metal hydroxide, for example 
lithium or sodium hydroxide. Alternatively an esterifying group 
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such as an arylmethyl group may be removed, for example, by 
hydrogenation over a catalyst such as palladium-on-charcoal using 
either hydrogen or ammonium formate as a hydrogen source by 
methods well-known to those skilled in the art. 
5 Electrolyte means a solution that has sufficient acid 

strength to render a basic starting material essentially 
protonated . 

Chemical derivative refers to a subject polypeptide having 
one or more residues chemically derivatized by reaction of a 

10 functional side group. Such derivatized molecules include for 
example, those molecules in which free amino groups have been 
derivatized to form amine hydrochlorides, p- toluene sulfonyl 
groups, carbobenzoxy groups, t -butyl oxycarbonyl groups, 
chloroacetyl groups or formyl groups. Free carboxyl groups may 

15 be derivatized to form salts, methyl and ethyl esters or other 
types of esters or hydrazides. Free hydroxyl groups may be 
derivatized to form O-acyl or 0-alkyl derivatives. The imidazole 
nitrogen of histidine may be derivatized to form N- imidazolyl - 
benzylhistidine . Also included as chemical derivatives are those 

20 peptides which contain one or more naturally occurring amino acid 
derivatives of the twenty standard amino acids. For examples, 
4-hydroxyproline may be substituted for proline; 5 -hydroxylysine 
may be substituted for lysine; 3 -methylhistidine may be 
substituted for histidine; homoserine may be substituted for 

25 serine; and ornithine may be substituted for lysine. 

Polypeptides of the present invention also include any 
polypeptide having one or more additions and/or deletions or 
residues relative to the sequence of a polypeptide whose sequence 
is shown herein, so long as the requisite activity is maintained. 

30 As used herein, fragment means any subject peptide or 

polypeptide having an amino acid residue sequence shorter than 
that of a peptide or polypeptide whose full length amino acid 
residue sequence is shown herein. 

A pharmaceutical^ acceptable salt is one which is prepared 

35 by contacting a compound of formulas (I) - (X) according to the 
specifications therein with an acid whose anion is generally 
considered suitable for human consumption. Examples of 
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pharmacologically acceptable acid addition salts include the 
hydrochloride, hydrobromide , hydroiodide, sulfate, phosphate, 
acetate, propionate, lactate, maleate, malate, succinate, and 
tartrate salts. All of these salts may be prepared by 
5 conventional means by reacting, for example, the appropriate acid 
with the corresponding compound of structure of Formulas (I) - 
(X) . 

Unless otherwise indicated, the preparation methods 
disclosed herein result in product distributions which include 

10 all possible structural isomers. It is understood that 
physiological response may vary according to stereochemical 
structure. The isomers may be separated by conventional means 
such as fractional crystallization or High Pressure Liquid 
Chromatography (HPLC) . Briefly, the absolute configuration of 

15 a compound relates to how its substituents are oriented in space 
about a central atom. This notion becomes significant when 
coupled with the rigors of chirality. Chirality involves the 
identity of the substituents ' about that central atom. Thus, in 
general, a compound is said to be chiral when four distinctly 

20 different groups are bound to a central carbon atom. These 
groups may be spatially aligned in more than one manner without 
repeating their individual orientations. That is, a chiral 
compound may exhibit a mirror image which is also chiral . These 
mirror images are termed meso configurations, and are each 

25 absolute configurations of a chiral compound. 

Pharmaceutical compositions according to the present 
invention comprise one or more peptides and/or peptidomimet ics 
of the invention in association with a pharmaceut ically 
acceptable carrier or excipient, adapted for use in human or 

30 veterinary medicine. The compositions may contain from 0.001-99% 
of the active material. Such compositions may be presented for 
use in conventional manner in admixture with one or more 
physiologically acceptable carriers of excipients. The 
pharmaceutical compositions according to the invention may also 

35 contain other active ingredients such as antimicrobial agents, 
or preservatives. The compositions may optionally further 
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contain one or more other therapeutic agents which may, if 
desired, be a chemotherapeutic antiviral agent. 

Pharmaceutically acceptable salts of the peptides of this 
invention may be formed conventionally by reaction with an 
5 appropriate acid. The addition salts so formed from addition by 
acid may be identified by hydrochloric, hydrobromic, phosphoric, 
acetic, fumaric, salicylic, citric, lactic, mandelic, tartaric, 
oxalic, methanesulf onic , and the like. 

Thus, the peptides and peptidomimetics according to the 

10 present invention may be formulated for oral, buccal, parenteral, 
topical or rectal administration. In particular, these peptides 
and peptidomimetics may be formulated for injection or for 
infusion and may be presented in unit dose form in ampoules or 
in multidose containers with an added preservative. The 

15 compositions may take such forms as suspensions, solutions, or 
emulsions in oily or aqueous vehicles, and may contain 
formulatory agents such as suspending, stabilizing and/or 
dispersing agents. Alternatively, the active ingredient may be 
in powder form for constitution with a suitable vehicle, e.g. 

20 sterile, pyrogen- free water, before use. 

The present invention further provides a process for 
preparing a pharmaceutical composition which comprises bringing 
a peptide and/or peptidomimetic of the invention into association 
with a pharmaceutically acceptable excipient or carrier. 

25 For administration by injection or infusion, the daily 

dosage as employed for treatment of an adult human of 
approximately 70 kg body weight will range from 0.01 mg to 10 mg 
of each active ingredient, preferably 0.1 to 5 mg, which may be 
administered in 1 to 4 doses, for example, depending on the route 

30 of administration and the condition of the patient. The dosage 
of the peptide used in the treatment will vary, depending on the 
seriousness of the disorder, the weight of the patient, the 
relative efficacy of the peptide and the judgment of the treating 
physician. However, suitable unit dosages in humans may be 

35 between about 0.05 mg to about 100 mg . For example, a unit 
dosage may be from between about 0.2 mg to about 50 mg . Such a 
unit dosage, described hereinabove, may be administered more than 
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once a day, e g., two or three times a day. Thus, the total 
daily dosage is in the range of about 0.01 mg to 10 mg/kg . Such 
therapy may extend for several weeks, in an intermittent or 
uninterrupted manner, until the patient's symptoms are 
eliminated . 

The present invention also provides pharmaceutical 
compositions which comprise a pharmaceut ically effective amount 
of the one or more peptides and/or peptidomimetics of this 
invention, or pharmaceutically acceptable salts thereof, and, 
preferably, a pharmaceutically acceptable carrier or adjuvant. 
Therapeutic methods of this invention comprise the step of 
treating patients in a pharmaceutically acceptable manner with 
those peptides or compositions. Such compositions may be in the 
form of tablets, capsules, caplets, powders, granules, lozenges, 
suppositories, reconstitutable powders, or liquid preparations, 
such as oral or sterile parenteral solutions or suspensions. 

In order to obtain consistency of administration, it is 
preferred that a composition* of the invention is in the form of 
a unit dose. The unit dose presentation forms for oral 
administration may be tablets and capsules and may contain 
conventional expedients. For example binding agents, such as 
acacia, gelatin, sorbitol, or polyvinylpyrrolidone ; fillers, such 
as lactose, sugar, maize-starch, calcium phosphate, sorbitol or 
glycine; tabletting lubricants such as magnesium stearate; 
disintegrants, such as starch, polyvinylpyrrolidone, sodium 
starch glycolate or microcrystalline cellulose; or 
pharmaceutically acceptable wetting agents such as sodium lauryl 
sulfate . 

The solid oral compositions may be prepared by conventional 
methods of blending, filling, tabletting, or the like. Repeated 
blending operations may be used to distribute the active agent 
throughout those compositions employing large quantities of 
fillers. Such operations are, of course, conventional in the 
art. The tablets may be coated according to methods well-known 
in normal pharmaceutical practice, in particular with an enteric 
coating . 
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Oral liquid preparations may be in the form of emulsions, 
syrups, or elixirs, or may be presented as a dry product for 
reconstitution with water or other suitable vehicle before use. 
Such liquid preparations may or may not contain conventional 
5 additives. For example suspending agents, such as sorbitol, 
syrup, methyl cellulose, gelatin, hydroxyethylcellulose , 
carboxymethylcellulose, aluminum stearate gel, or hydrogenated 
edible fats; emulsifying agents, such as sorbitan monooleate or 
acacia; non-aqueous vehicles (which may include edible oils) , 

10 such as almond oil, fractionated coconut oil, oily esters 
selected from the group consisting of glycerine, propylene 
glycol, ethylene glycol, and ethyl alcohol; preservatives, for 
instance methyl para-hydroxybenzoate , ethyl para-hydroxybenzoate , 
n-propyl parahydroxybenzoate, or n-butyl parahydroxybenzoate or 

15 sorbic acid; and, if desired, conventional flavoring or coloring 
agents . 

For parenteral administration, fluid unit dosage forms may 
be prepared by utilizing the peptide and a sterile vehicle, and, 
depending on the concentration employed, may be either suspended 

20 or dissolved in the vehicle. In preparing solutions, the 
peptides of this invention may be dissolved in water, whereas 
opiates used heretofore showed only marginal solubility in 
aqueous media or physiological fluids. Once in solution, the 
peptide may be injected and filter sterilized before filling a 

25 suitable vial or ampoule and subsequently sealing the carrier or 
storage package. Adjuvants, such as a local anaesthetic, a 
preservative or a buffering agent, may be dissolved in the 
vehicle prior to use. Stability of the pharmaceutical 

composition may be enhanced by freezing the composition after 

30 filling the vial and removing the^ water under vacuum, e.g., 
freeze drying the composition. Parenteral suspensions may be 
prepared in substantially the same manner, except that the 
peptide should be suspended in the vehicle rather than being 
dissolved. A surfactant or wetting solution may be 

35 advantageously included in the composition to facilitate uniform 
distribution of the peptide. 
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The stability of the peptides and cyclized peptides of the 
present invention exceeds that of naturally occurring peptides 
if substitution is made with D-amino acids in at least 20%, but 
not more than 50%, of those residues which are naturally present 
5 in the (L) configuration. Without being bound by theory, we 
believe that the increased resistance to enzymatic degradation 
over of the peptides of the present invention as compared to 
natural peptides is due to the presence of D-amino acids in the 
peptides. This switch from L to D amino acids neutralizes the 

10 digestion capabilities of many of the ubiquitous peptidases found 
in the digestive tract. Alternatively, the enhanced stability 
of the peptides of this invention may also be the result of the 
introduction of modifications of traditional peptide linkages. 
For example, the introduction of a cyclic ring within the peptide 

15 backbone may confer enhanced stability in order to circumvent the 
effect of many proteolytic enzymes known to digest small peptides 
in the stomach or other digestive organs and in serum. 

The compounds of the " present invention are initially 
synthesized by either solution or by solid phase techniques. 

20 Specific exemplary syntheses are described in the examples 
hereinbelow. The peptides of this invention may be prepared by 
initially reacting a first appropriately protected amino acid 
with a second appropriately protected amino acid in an organic 
solvent inert to the reactants, in the presence of a suitable 

25 peptide coupling agent according to the following scheme: 

Z (l)NH-AA(l) -COOH + coupling agent + NH 2 - AA ( 2 ) - COOZ ( 2 ) = 
Z (1) -NH-AA(l) -CONH-AA{2) -COOZ (2) 
wherein Z(l) is a suitable nitrogen protecting group and Z(2) is 
a suitable carboxyl protecting group and AA represents any 

30 natural or unnatural amino acid residue. The desired peptides 
may be prepared by utilizing the appropriate amino acids and 
repeating this reaction sequence as required until a peptide with 
three to ten amino acid residues has been prepared. A suitable 
deprotection method is then employed to remove specified or all 

3 5 of the remaining protecting groups or the peptide from the resin. 

The first appropriately protected amino acid and, for 
instance, an appropriately protected tyrosine may be reacted 
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together in the presence of a suitable peptide coupling agent in 
a suitably inert organic solvent with stirring, shaking, or 
agitation to form a protected tyrosine containing dipeptide. 
Introducing this dipeptide to appropriate protecting group 
5 removal conditions affords a selectively deprotected dipeptide 
which is well-suited for continued peptide synthesis. Contacting 
this mono-deprotected tyrosine containing dipeptide with an 
appropriately protected amino acid having a side chain 
represented as above, in the presence of a suitable peptide 

10 coupling agent in a suitably inert organic solvent with stirring, 
shaking, or agitation forms a protected tyrosine containing 
tripeptide. This method may be repeated as many times as 
necessary to achieve the desired peptide. 

The method of preparation for peptide synthesis requires 

15 specific functional groups to react with other substituents to 
link amino acid residues in a desired manner to form a peptide 
possessing a known and desired sequence of amino acid residues. 
Since amino acids possess at least two reactive functional 
groups, suitable protection, blocking, or masking of these groups 

20 is required to ensure that reaction will occur only at 
specifically desired sites. 

These protecting groups should be introduced to the moiety 
efficaciously while their removal should be performed under 
conditions which do not affect other portions of the molecule. 

25 In this manner, certain reactions and modifications may be 
performed on the amino acid, peptide, or other compound, with 
assurance that the protected functionality will not interfere 
with the desired reaction. Further, by choosing a protecting 
group that is sensitive and labile to certain reactive 

30 conditions, a reaction scheme may be outlined to advantageously 
utilize these characteristics to effectively remove the 
protecting group once the synthesis is complete. 

Both N-protecting groups and COOH-protecting groups (see 
definitions) may be used within the scope of this invention. A 

35 variety of protecting groups known in the field of peptide 
synthesis and recognized by conventional abbreviations therein, 
may be found in T. Greene, Protective Groups In Organic 
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Synthesis , Academic Press (1981) . Among the preferred protecting 
groups that may be utilized for suitable protection of reactive 
nucleophilic substituents include, for example, benzyl (Bz), 
carbobenzyloxy (Cbz) , t -butoxycarbonyl (Boc) , or 

5 9-f luorenylmethyloxy-carbonyl (Fmoc) . 

Coupling of amino acids, which may be the same or different 
as those described above, to yield small peptides in route to 
peptides comprised of greater numbers of amino acid residues may 
be accomplished by employing established techniques in the field 

10 of peptide chemistry. A broad range of suitable reactions are 
described in E. Gross and J. Meinhofer, The Peptides: Analysis, 
Synthesis , Biology ; Modern Techniques of Peptide and Amino Acid 
Analysis , John Wiley & Sons, (1981) and M. Bodanszky, Principles 
Of Peptide Synthesis , Springer-Verlag (1984) . The peptide 

15 coupling agents which may be used to assist condensation of amino 
and carboxylic acid moieties include 
N,N' -dicyclohexylcarbodiimide (DCC) , N, N' -carbonyl diimidazole 
(CDI) , 1 -hydroxy benzotriazole (HOBt) , ethyl chlorof ormate , 
benzyl chloroformate, 

20 1- (ethoxycarbonyl ) - 2 -ethoxy- 1 , 2 - dihydroquinoline (EEDQ) , 

benzotriazoyl-oxy-tris- (dimethyl) amino-phosphonium hexaf luoro 
phosphate (BOP) and the like. A preferred technique uses DCC as 
the coupling reagent. The DCC method may be used with or without 
catalytic additives such as 4 -dimethylaminopyridine (DMAP) , 

25 copper (II) chloride or HOBt to hasten the reaction and suppress 
the racemization of the desired compound. 

The DCC reaction is often performed at room temperature but 
may be carried out from about -78 °C to gentle reflux in a 
variety of solvents that are inert with respect to the reactants. 

3 0 The solvents are normally organic solvents which are polar and 
aprotic. Preferred solvents include, for example, 

dichloromethane , chloroform, diethyl ether, tetrahydrof uran 
(THF) , N, N' -dimethylf ormamide (DMF) , and the like. Particularly 
preferred solvents are dichloromethane and DMF. In general, the 

3 5 coupling reaction may be carried out at atmospheric pressure a 
temperature of -78 °C to reflux for a period of between 1 and 4 8 
hours. Preferably, the reaction is carried out at about -10° C 
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to 25° C with stirring, shaking or agitation, over a period of 
between 4 and 6 hours. 

Alternatively, synthesis may be achieved prepared using 
solid phase synthesis, such as that described by Merrifield, J 
5 Am. Chem. Soc . , 85, p 2149 (1964), although other equivalent 
chemical syntheses known in the art can also be used. 
Solid-phase synthesis is commenced from the C-terminus of the 
peptide by coupling a protected alpha-amino acid to a suitable 
resin as generally set forth in U.S. Pat. No. 4,244,946. 

10 As an example, lie protected by BOC is coupled to the a BHA 

resin using methylene chloride and dimethylf ormamide . Following 
the coupling of BOC- lie to the resin support, the alpha-amino 
protecting group is removed, as by using trif luoroacetic acid 
(TFA) in methylene chloride, TFA alone or with HC1 in dioxane . 

15 Preferably 50 volume % TFA in methylene chloride is used with 0-5 
weight % 1,2 ethanedithiol . The deprotection is carried out at 
a temperature between about 0° C and room temperature. Other 
standard cleaving reagents and conditions for removal of specific 
alpha-amino protecting groups may be used as described in 

20 Schroder & Lubke, The Peptides , pp 72-75 (Academic Press 1965) . 

After removal of the alpha-amino protecting group of lie, 
the remaining alpha-amino- and side chain-protected amino acids 
are coupled step-wise in the desired order to obtain the 
intermediate compound defined hereinbefore. As an alternative 

25 to adding each amino acid separately in the synthesis, some of 
them may be coupled to one another prior to addition to the solid 
phase reactor. The selection of an appropriate coupling reagent 
is within the skill of the art. Particularly suitable as 
coupling reagents are N, N' -dicyclohexyl carbodiimide (DCC) and 

30 N,N'-diisopropyl carbodiimide (DICI ) , or 

N-ethyl-N' - ( 3 -dimethylaminopropyl ) carbodiimide. Other 
activating reagents and their use in peptide coupling are 
described by Schroder & Lubke, supra, in Chapter III and by 
Kapoor (1970) J. Phar. Sci . 59 : 127 . 

35 Each protected amino acid or amino acid sequence is 

introduced into the solid phase reactor in about a fourfold 
excess, and the coupling is carried out in a medium of 
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dimethyl formamide : dichloromethane (1:1) or in DMF or 
dichloromethane alone. In instances where the coupling is 
carried out manually, the success of the coupling reaction at 
each stage of the synthesis is monitored by the ninhydrin 
5 reaction, as described by E. Kaiser et al . (1970) Anal. Biochem. 
3_4:5 95. In cases where incomplete coupling occurs, the coupling 
procedure is repeated before removal of the alpha -amino 
protecting group prior to the coupling of the next amino acid. 
The coupling reactions can be performed automatically, as on a 

10 Applied Biosystems automatic synthesizer. 

After the desired amino acid sequence has been completed, 
the intermediate peptide is removed from the resin support by 
treatment with a reagent, such as liquid hydrogen fluoride, which 
not only cleaves the peptide from the resin but also cleaves all 

15 remaining side chain protecting groups and the alpha-amino 
protecting group (unless it is an acyl group which is intended 
to be present in the final peptide) to obtain the peptide. When 
using hydrogen fluoride for cleaving, anisole or cresol and 
methylethyl sulfide are included in the reaction vessel as 

20 scavengers. When Met is present in the sequence, the BOC 
protecting group may be cleaved with trif luoroacetic acid 
(TFA) /ethanedithiol prior to cleaving the peptide from the resin 
to eliminate potential S-alkylat ion . 

All patents and publications referred to in the examples, 

25 and throughout the specification, are incorporated herein by 
reference, without admission that such is prior art. 

The following nonlimiting examples are provided to 
illustrate the invention. The skilled artisan will recognize 
that there may be substitutions and variations of the exemplified 

30 methods and compositions which are apparent and can be practiced 
without departing from the essence of the invention. 

EXAMPLES 

Example 1 . Peptide Synthesis 

The synthesis of the peptide of SEQ ID NO:l (Thr- Ile-Glu- 
35 Asp-Ser-Tyr-Arg-Lys-Gln-Val-Val-Ile-Asp) is conducted in a 



WO 97/10836 



PCT/US96/15098 



38 

stepwise manner on a MBHA hydrochloride resin, such as available 
from Bachem, Inc. (Torrance, CA) having a substitution range of 
about 0.1 to 0.5 mmoles/gm. resin. 

All equipment employed in the examples is commercially 
available. Unless otherwise indicated, all starting materials 
employed in the examples are commercially available. Sources for 
these materials include Sigma Chemical Co. (St. Louis, MO), 
Aldrich Chemical Co. (Milwaukee, WI) , Lancaster Synthesis 
(Windham, NH) , Fisher Scientific (Pittsburgh, PA), Boehringer 
Mannheim Biochemicals (Indianapolis, IN) , Fluka Chemical Corp. 
(Ronkonkoma, NY) and Chemical Dynamics Corp. (South Plainfield, 
NJ) . Most of the starting materials were obtained from Aldrich 
Chemical Co. (Milwaukee, WI) . 

All solvents used in the peptide preparations described 
herein, e.g. methylene chloride dichloromethane , 2-propanol, 
dimethylf ormamide (DMF) , and methanol, were Burdick and Jackson 
"distilled in glass" grade and used without additional 
distillation. Trif luoroacetic acid (TFA) , diisopropylethylamine 
(DIPEA) , piperidine (PIP) , dicyclohexylcarbodiimide (DCC) , 
1-hydroxybenzotriazole (HOBt) , and [benzotriazole - 1 -yl - 
oxy-tris (dimethyl) phosphonium hexaf luorophosphate] (BOP) were 
purchased from Chemical Dynamics Corp. and were "sequenal" grade 
purity. 1, 2-ethanedithiol (EDT) was purchased from Sigma 
Chemical Co. and used without further purification. All 
protected amino acids were of the L-conf igurat ion unless 
otherwise indicated and were obtained from Bachem (Torrance, CA) . 
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The synthesis is performed on an Applied Biosystems peptide 
synthesizer (Foster City, CA) using a suitable program, 
preferably as follows: 



STEP 


REAGENTS AND OPERATIONS 


MIX TIMES 


1 


Dichloromethane-80 ml. 


2 


2 


Methanol(MeOH) wash-30 ml. 


2 


3 


Dichloromethane-80 ml. 


3 


4 


50 percent TFA plus 5 percent 1 ,2-ethane- dithiol in 
dichloromethane -70 ml. 


2 


5 


Isopropanol wash-80 ml. 


2 


6 


TEA 12.5 percent in dichloromethane -70 ml. 


2 


7 


MeOH wash-40 ml. 


2 


8 


Dichloromethane wash-80 ml. 


3 


9 


Boc-amino acid (10 mmoles) in 30 ml. of either DMF or 
dichloromethane, depending upon the solubility of the 
particular protected amino acid, (1 time) plus DCC (10 
mmoles) in dichloromethane (reaction time 20-200 min) 





Note: All wash and mix times three minutes except where noted. 



15 Coupling of BOC-ASP(OBz) results in the substitution of 

about 0.35 mmol ASP per gram of resin. All solvents that are 
used are carefully degassed, preferably by sparging with an inert 
gas, e.g., helium or nitrogen, to insure the absence of oxygen. 

After deprotection and neutralization, the peptide chain is 

20 built stepwise on the resin. Generally, one to two mmol. of 
BOC-protected amino acid in methylene chloride is used per gram 
of resin, plus one equivalent of 2 M DCC in methylene chloride, 
for two hours. When BOC-Arg (Tos ) is being coupled, a mixture of 
50% DMF and methylene chloride is used. Bzl is used as the 

25 hydroxyl side-chain protecting group for Ser and Thr . 

p-nitrophenyl ester (ONp) can be used to activate the carboxyl end 
of Asn or Gin; for example, BOC-Asn(ONp) can be coupled overnight 
using one equivalent of HOBt in a 50% mixture of DMF and 
methylene chloride. The amido group of Asn or Gin is protected 
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by Xan when DCC coupling is used instead of the active ester 
method. 2-C1-CBZ is used as the protecting group for the Lys 
side chain. Tos is used to protect the guanidine group of Arg 
and the imidazole group of His, and the side-chain carboxyl group 
5 of Glu or Asp is protected by OBzl . 

To cleave and deprotect the resulting protected 
peptide-resin, it is treated with 1.5 ml anisole, 0.5 ml of 
methylethylsulf ide and 15 ml liquid hydrogen fluoride (HF) per 
gram of peptide-resin, first at -20 °C for 20 min and then at 0 

10 °C for 30 min. This reaction must be performed with great care 

owing to the highly toxic and corrosive nature of hydrogen 
fluoride. This reaction is performed in a commercially available 
teflon apparatus (Peninsula Research, Inc., Richmond, CA) . After 
complete elimination of HF under high vacuum using a KOH trap, 

15 the resin-peptide is washed alternately with dry diethyl ether 
and chloroform, and the peptides are then extracted with degassed 
2 N aqueous acetic acid and separated from the resin by 
filtration on a Hirsch funnel. 

The peptide is purified by gel permeation followed by 

20 preparative HPLC as described in Marki et al . (1981) J. Am. Chem. 

Soc. 103:3178; Rivier, et al. (1984) J. Chromatography 
288.: 303-328 ; and Hoeger, et al . (1987) BioChromatography 
2:134-142. The chromatographic fractions are carefully monitored 
by HPLC (see below) , and only the fractions showing substantial 

25 purity are pooled. 

To confirm that the desired sequence is achieved, the 
peptide is hydrolyzed in sealed evacuated tubes containing 
constant boiling HC1, 3 /il of thioglycol/ml and 1 nmol of Nle (as 
an internal standard) for 9 hours at 140 °C. Amino acid analysis 

30 of the hydrolysates using a Beckman 121 amino acid analyzer to 
determine amino acid ratios allows confirmation that the desired 
peptide structure has been obtained. 

Example 2 . Cvclization of Active Peptides 

Cyclization "traps" the bioactive conformation of the 
35 peptide by making the active conformation part of a ring system 
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that allows it much less conformational flexibility. In this 
procedure, aspartate or glutamate residues are introduced into 
the sequence either in place of non-essential amino acid residues 
or as added residues in the chain. The new peptide is then 
5 subjected to electro-oxidation in which the two residues are 
decarboxylated, in an intramolecular Kolbe electro-oxidative 
coupling reaction, resulting in the joining of their respective - 
CH 2 groups, forming a ring as shown in Fig. 1. This method has 
been used to make a cyclized /3-bend of the dipeptide, Pro-Gly, 

10 by placing a glutamic acid residue on the amino and carboxyl ends 
of this dipeptide and then performing the Kolbe electro-oxidation 
to form the tetra- (CH 2 ) -bridge . The Pro-Gly peptide, which has 
a variety of conformations in solution, when cyclized, was found 
to adopt the /S-bend structure uniquely ( Joran, A. , 

15 M Conformational^ restricted biologically active peptides, 
methods for their production and uses thereof," U.S. Patent No. 
5,3 64,851.) This method has been used quite recently to 
synthesize cyclized forms of the peptide vasopressin; these forms 
have been tested in an in vitro adenylate cyclase system and have 

20 been found to have prolonged half-lives and greater activity than 
the native peptide. Therefore, this cyclization procedure may 
result in enhanced peptide inhibition and in increased half -life. 
Introduction of the cyclizing rigidifying agent reduces the 
flexibility of the peptide and concurrently introduces non-polar 

25 aliphatic groups into the peptide (such as the tetra-methylene 
bridge shown in Scheme I) that help promote transport of the 
peptide through the cell membrane. 
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Scheme I illustrates an exemplary result of using 
electrolytic decarboxylation to cyclize peptides to trap them in 
their active conformations. Either two gultamate, two aspartate, 
or one glutamate and one aspartate residues are introduced in the 
chain represented by the wavy line. Under electro-oxidation 
shown here for two glutamate residues, a tetramethylene bridge 
forms cyclizing the region of the peptide that is to be held 
fixed in its bioactive conformation. The two connected residues 
shown constitute the diamino suberic acid moiety. 

It should be noted that, in the synthesis of this new 
peptide, there are possibly other aspartate and glutamate amino 
acid residues that can undergo the oxidative decarboxylation. 
To prevent these reactions from occurring, these Asp and Glu 
residues are protected as esters during the solid phase synthesis 
of the peptide. The free Glu residues at positions 44 and 46 are 
then allowed to undergo the cyclization reaction, after which the 
protected acid groups are then deprotected. 

This cyclization procedure can be performed on other regions 
of this peptide and on the other two active peptides. 

The electrooxidative coupling reaction used to prepare the 
cyclic peptides of the invention can be performed in a divided 
or an undivided cell such as a standard glass H-cell, as 
described in Organic Electrochemistry (2nd Ed.), M. Baizer and 
H. Lund, eds., Marcel Dekker, New York, 1983, Chap. 5, p 168. 
For large scale runs, the reaction can be carried out in a plate 
and frame flow cell as described in Techniqu e of Electroorganic 
Synthesis , Part III, N. Weinberg and B. Tilak, ed . , John Wiley 
6c Sons, New York, 1982, Chap. Ill, p 179. 

Cathode materials useful for the preparation of the 
compounds of the invention include, but are not limited to, high 
hydrogen overvoltage materials such as mercury, lead or cadmium. 
Anode materials include, but are not limited to, materials such 
as mercury, lead, graphite, or graphite paste, which are stable 
under electrolysis conditions. 
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The electrooxidative coupling can occur in aqueous, or 
aqueous organic electrolytes, comprising solutions of Bronsted 
acids, such as sulfuric, fluoroboric, and trif luoroacet ic acids. 
Any electrolyte may be selected that has sufficient acid strength 
to render a basic starting material protonated. A dilute 
solution of trif luoroacetic acid is most preferred. 

Although the preferred method of electrolysis to obtain the 
compounds of this invention takes place under constant current 
conditions, the oxidative coupling could also be performed using 
controlled potential electrolysis, as understood by those skilled 
in the art. Typical current densities are between 1 and 5000 
milliamps (mA) /cm 2 , preferably between 10 and 100 mA/cm 2 . The 
reaction is preferably carried out at a temperature in the range 
of about 0 °C to 37 °C, more preferably about 10 °C. 

A standard glass H-cell (200 ml volume, glass frit 
separator) was equipped with a mercury pool cathode 12 cm 2 area) , 
a magnetic stirrer, and a platinum foil anode. The cell 
reservoir was filled with 40 mM trif luoroacetic acid (110 ml) and 
placed in a water bath maintained at 10 °C. The catholyte was 
purged with nitrogen. The starting peptide (20 mg) was added to 
the catholyte and constant current electrolysis was initiated at 
0.1 A. The reaction progress was followed by HPLC and after 
passage of 1,060 coulombs, all the substrate had been consumed 
and the electrolysis was terminated. The catholyte was recovered 
and adjusted to pH 8 with NaOH . The pH- adjusted catholyte was 
extracted with chloroform (2 times 70 ml) . The extract was 
freeze dried and the resultant powdery material extracted with 
acetonitrile (HPLC grade) . This was filtered through a sintered- 
glass filter (fine porosity) and was reduced in volume on a 
rotary evaporator using a mechanical vacuum pump to a volume of 
2 ml. This material was purified by reversed-phase high pressure 
liquid chromatography using a Waters HPLC system with a 0.4 6 x 
0.25 cm column packed with 5 jxm C 18 silica, 300 A pore size. 
Buffer A is an aqueous 0.1% (vol/vol) trif luoroacetic acid 
solution (1.0 ml of TFA per 1000 Ml solution); Buffer B is 100% 
acetonitrile. The determination is run at room temperature with 
a gradient from 15.5% Buffer B to 75% Buffer B over a 30 min. 
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The flow rate is 2.2 ml per minute, and the retention time is 
25.0 min. 

The structure was confirmed by 30 0 MHz X H NMR, 13 C NMR , and 
electrospray mass spectroscopy. 

The amounts of the reactants and the conditions required to 
facilitate reaction and encourage efficient completion of the 
aforementioned Examples may vary widely. However, in general, 
the amounts of material employed to induce reaction in the 
processes discussed above will be substantially stoichiometric, 
unless otherwise specified. In the following examples, reaction 
concentrations are generally held at 0.1 M for the reactants, 
unless a higher concentration or dilution would be particularly 
useful for influencing the direction of a specific reaction. In 
practice, the amounts used will depend upon variations in 
reaction conditions and the nature of the reactants as readily 
apparent to one of ordinary skill in the art. 

In any of the methods described hereinabove, the desired 
products may be isolated from the reaction mixture by 
crystallization. Alternatively, chromatographic techniques 
including, but not limited to, normal phase, reverse phase, 
ion-exchange, affinity, or gel permeation, may be employed, as 
well as electrophoresis or extraction or other means. 

Example 3 . Oocyte Maturation Assay 

Using the method described in Chung et al . (1991) Anticancer 
Res. 11 : 1373-1378 , test peptides, cyclized peptides and/or 
peptidomimetics are injected into immature oocytes at various 
doses. The oocytes are co- injected with recombinant transforming 
ras p21 obtained from the National Cancer Institute of Japan. 
Alternatively, the oncogenic ras p21 can be prepared by the 
ordinary skilled artisan without the expense of undue 
experimentation as described in Chung et al . (1991) supra and in 
Chung et al . (1992) Exp. Cell. Res. 203:329-335 The maturation 
of the oocytes is evaluated microscopically at low power (20 X) , 
using a Nikon Diaphot microscope, for example. Percent 
inhibition is calculated based on comparisons with oocytes which 
are injected with 0.05 mg/ml oncogenic ras p21. 
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The following results were obtained using a dose of each 
peptide equivalent to an internal oocyte concentration of 50 nM: 



Peptide OR spoupnrp in 

Number 


Percent Inhibition of 
ras-lnduced Maturation 


6 


\ 28 


Val-Val-lle 


34 


7 


56 


Lys-Arg-Val 


22 


1 


76 


8 


92 


2 


38 


9 


65 


4 


22 



The peptidomimetics and cyclic peptides of the present 
invention will be similarly effective in inhibiting oocyte 
maturation in response to the oncogenic ras p21 protein, and in 
inhibiting oncogenesis. 
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Example 4 . Synthesis of Pept idomimetic p21 ras Inhibitor 

A representative pept idomimetic of the present invention is 
synthesized according to Scheme II, as further described 
hereinbelow . 

5 Scheme II 




Na, nPfOH 
reflux, 79% 



R = CH 3 CH(C^} 3 CH(CH 3 ) 2 




R' = tBuC^C C H 2 C O 

Steroid 1 was commercially available, and used without further 
purification. Mono- t-butylmalonate was prepared by literature 
methods [Brunwin, D.M. ; et al . (1971) J . Chem. Soc . C. 3756]. 
THF was distilled from sodium/benzophenone under Ar. Methylene 
10 chloride and ethanol (absolute) were distilled from CaH2 under 
Ar. Benzylchlorof ormate and n-propyl alcohol are commercially 
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available, and used without further purification or drying. All 
reactions performed under an atmosphere of Ar unless otherwise 
noted. 

5-a-3-/E3-hydroxy-6-oximinocholestane (2): Cholestanone 1 (1.00 
5 g. 2.5 mmol) , NaOAc (352 mg, 4.3 mmol) and hydroxylamine 
hydrochloride (197 mg, 2.8 mmol) were heated at reflux in 
absolute ethanol (15 ml) . The reaction was efficiently stirred 
under Ar for llh at reflux. The reaction was cooled, and the 
solvent removed on the rotary evaporator. The resulting white 

10 solid was dissolved in CHC1 3 , and washed once with brine (80 ml) . 

The organic layer was dried over Na 2 S0 4/ filtered, and 
concentrated in vacuo. The yield of crude material was 1.06 8 g 
of a lightly colored solid. The material was recrystallized from 
absolute ethanol (15.5 mL) to give 752 mg (72%) of white needles, 

15 mp = 202-204°C (dec). X H NMR (CDC1 3 ) : 63.60 (m, 1H) , 3.33 (dd, J 

= 4.37, 13.6 Hz, 1H) , 2.10-1.75 (broad m, 6H) , 1.70-1.45 (broad 
m, 5H) , 1.42-1.28 (broad m, 8H) , 1.14 (m, 9H) , 0.92 (d, J" = 6.87 
Hz, 3H) , 0.88 (d, J= 6.54 Hz, 6H) , 0.77 (s, 3H) , 0.67 (s, 3H) . 
13 C NMR (CDC1 3 ) : 6159.55, 70.71, 56.39, 55.91, 54.04, 49.26, 

20 42.60, 39.42, 39.20, 38.57, 35.92, 35.84, 35.63, 35.43, 31.24, 

30.40, 29.39, 27.87, 27.70, 23.80, 23.53, 22.51, 22.26, 21.17, 
18.36, 12.34, 11.79, IR (neat film): 3354, 2941, 1667, 1467, 
1065, 978 cm" 1 . 

5-of-6-a - amino - 3 - 0 - hydroxy cho 1 e s t ane (3): Oxime 2 (752 mg, 1.8 
25 mmol) was dissolved with good stirring in boiling n-propyl 
alcohol (32 mL) . The flask was removed from the oil bath, and 
small pieces of freshly prepared sodium wire (2.898 g, 126 mmol) 
were added at a rate sufficient to maintain the reflux. After 
the addition of the sodium was complete, the flask was lowered 
30 into the bath, and stirred at reflux for 2h (a thick, white crust 
forms) . The reaction was cooled to room temperature, and 
carefully quenched by slow, dropwise addition of water under an 
inert atmosphere. The quenched reaction was extracted twice with 
CHCI3 (50 mL) , and the extracts were washed with brine, dried 
35 over Na 2 S0 4 , filtered, and concentrated. The yield of a vanilla 
colored solid was 723 mg. The material was purified by flash 
chroma trography on silica in CHCl 3 /MeOH (3:2) to give 577 mg 
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(79%) of a vanilla colored solid. Alternatively, the product can 
be recrystallized from EtOAc . *H NMR (CDC1 3 ) : 63.59 (m, 1H) , 
2.60 (dd, J = 2. 87, 10.35, 20.73Hz, 1H) , 2.09 (d, *J= 11.79Hz, 
1H) , 1.99 (d, J= 12.94Hz, 1H) , 1.85 (m, 3H) , 1.78 - 1.45 (broad 
5 m, 10H) , 1.36 (m, 5H) , 1.24 - 0.96 (m, 9H) , 0.89 (m, 10H) , 0.83 

(s, 3H) , 0.77 (m, 2H) , 0.67 (s, 3H) . 

(benzyloxy carbony 1 ) - 5- af-6-a - amino - 3 - /3 -hydroxycholes tane (4) : 
The amine 3 (46 5 mg . 1.15 mmol) and anhydrous K 2 C0 3 (4 57 mg, 3.3 
mmol) were stirred in dry THF (4 mL) , and cooled to 0°C in an 

10 ice-water bath. Benzyl - chlorof ormat e (0.16 mL, 1.15 mmol) was 
added dropwise, and the reaction was stirred at 0°C for 15 min, 
then at room temperature for 6h. The reaction was diluted with 
water and poured into saturated NaHC0 3 (15 mL) . A thick, 
flocculent white precipitate formed. The aqueous mixture was 

15 extracted with CHC1 3 (20 mL) , and the aqueous layer was saturated 
with NaCl and extracted with CHC1 3 (5x2 0 mL) . The organic 
extracts were washed with brine (40 mL) , dried over Na 2 S0 4/ 
filtered, and concentrated. The yield of crude material was 64 9 
mg of a tan solid. The material was flash chromatographed on 

20 silica in Hex/EtOAc (1:1) , (using a little CHC1 3 to help dissolve 
material) to give 546 mg (88%) of a white solid. X H NMR (CDC1 3 ) : 
57.88 (m, 4H) , 7.23 (s, 1H) , 5.08 (d, d= 3.08Hz, 2H) , 4.86 (d, 
J = 9.46Hz, 1H) , 3.53 (m, 1H) , 3.50 (s, 1H; overlaps with the 
multiplet at 3.53), 2.00 (d, J = 12.35Hz, 2H) , 1.85 (m, 3H) , 1.53 

25 (m, 4H) , 1.49 - 1.22 (broad m, 9H) , 1.20 - 0.98 (broad m, 10H) , 

0.90 (m, 9H) , 0.87 (s, 3H) , 0.77 (m, 2H) , 0.66 (s, 3H) . IR (neat 
film): 3346 (broad), 1691, 1544, 1022 cm" 1 . 

N- (benzyloxy carbonyl) - 5- a- 6- ar - amino - 3 - /3 - mono - 1 - butyl malonyl 
cholestane (5): The amine 4 (144.4 mg, 0.27 mmol), mono-t-butyl 

30 malonate (107.7 mg, 0.67 mmol), and DMAP were dissolved in dry 
CH 2 C1 2 (1.2 mL) , and stirred efficiently at room temperature. 
DCC (67.4 dmg, 0.33 mmol) was added in one portion under a stream 
of Ar. The reaction was stirred at room temperature for 23h, 
then diluted with ether, and filtered through a pad of Celite (a 

35 white solid remains on the pad) . The filtrate was washed with 
10% citric acid (20 mL) , saturated NaHC0 3 (20 mL) , and brine (20 
mL) . The organic layer was dried over Na 2 S0 4 , filtered and 
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concentrated. The yield of crude material was 2 00 mg. The 
material was flash chromatography in Hex/EtOAc (6.7:1) to give 
168.7 mg (92%) of a yellow-gold residue X H NMR (CDC1 3 ): 67.36 
(s, 4H), 7.29 (s, IH), 5.08 (q, J= 12.25, 27.09Hz, 2H) , 4.70 
(m, IH), 4.38 (d, J = 9.56Hz, IH) , 3.27 (s, 2H) , 2.00 (m, 2H) , 
1.85 (m, 4H) , 1.54 (m, 3H) , 1.48 (s, 9H) , 1.35 (M, 8H) , 1.11 ( m ,' 
11H) , 0.89 9(m, 12H) , 0.75 (m, 2H) , 0.66 (s, 3H) . 

The itf-CBZ group of 5 was cleanly removed by hydrogenolysis 
using 10% palladium on carbon under a hydrdogen atmosphere to 
give 6 in 93% yield. Trials employing 1 , 4 -cyclohexadiene as the 
hydrogen source [Felix et al . (1978) J". Org. chem. 43:4194] 
produced 6 in only 23% yield, even when a large excess of the 
diene was used. Longer reaction time did not improve the yield. 
5-a-6-a-amino-3-^-mono-t-butyl malonyl cholestane (6): 5 (200.7 
mg, 0.3 mmol) and 10% palladium on carbon (194 mg) were stirred 
vigorously in abs . EtOH (1.8 mL) . The system was flushed with a 
balloon of hydrogen, and an atmosphere of hydrogen was maintained 
by 2 balloons of hydrogen. The reaction was stirred for 20h at 
room temperature, then vented with Ar, and suet ion- filtered 
through a tighly packed pad of Celite . The flask and filter cake 
were thoroughly washed with 1:1 EtOH/THF. The filtrate was 
concentrated in vacuo to give 167.8 mg of material. The crude 
product was flash chromatographed first in Hex/EtOAc (3:2) to 
elute off minor byproducts, then CHCl 3 /MeOH (9:1) to give the 
product in a yield of 152.3 mg (93%) of a golden residue. *H NMR 
(CDC1 3 ) : 64.76 (m, IH) , 3.27 (s, 2H) , 2.60 (m, IH) , 2.22 (m, IH) , 
2.05 - 1.71 (broad m, 6H) , 1.51 (s, 3H) , 1.49 (s, 9H) , 1.34 
(broad m, 8H) , 1.24 - 0.97 (broad m, 12H) , 0.92 (d, J" = 6.55Hz, 
3H), 0.88 (d, J = 6.64Hz, 6H) , 0.85 (s, 3H) , 0.75 (m, 2H) , 0.67 
(s, 3H) , IR (neat film): 3368, 2946, 2868, 1747, 1729, 1144, 1008 
cm" 1 . 

Reductive amination of N(BOC) aminoacetaldehyde [Buchardt 
et al. (1993) Org. Prep. Proc . Int. 25:457] is promoted by the 
use of NaBH 3 CN and catalytic acetic acid. t-Butylester and BOC 
groups are removed. The general plan is given in Scheme III. 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: INNAPHARMA, INC. 

(ii) TITLE OF INVENTION: Peptides and Pep tidomime tics Inhibiting 
the Oncogenic Action of P21 Ras 

(iii) NUMBER OF SEQUENCES: 52 
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(A) ADDRESSEE: Greenlee, Winner and Sullivan, P.C. 
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(v) COMPUTER READABLE FORM: 
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(C) OPERATING SYSTEM: PC -DOS /MS -DOS 
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(ix) TELECOMMUNICATION INFORMATION: 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 

lie Lys Arg Val Lys Asp 
1 5 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 

Lys Cys Asp Leu Ala 
1 ~ 5 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 

Cys Asp Leu Ala Ala Arg Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

Asp Leu Ala Ala 
1 

(2) INFORMATION FOR SEQ ID NO : 5 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 188 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Thr Glu Tyr Lys Leu Val Val Val Gly Ala Gly Gly Val Gly Lys 
1 5 10 15 

Ser Ala Leu Thr lie Gin Leu lie Gin Asn His Phe Val Asp Glu Tyr 
20 25 30 

Asp Pro Thr lie Glu Asp Ser Tyr Arg Lys Gin Val Val lie Asp Gly 
3 5 4 0 4 5 

Glu Thr Cys Leu Leu Asp lie Leu Asp Thr Ala Gly Gin Glu Glu Tyr 
5 0 55 60 

Ser Ala Met Arg Asp Gin Tyr Met Arg Thr Gly Glu Gly Phe Leu Cys 

65 70 75 ~ 80 

Val Phe Ala lie Asn Asn Thr Lys Ser Phe Glu Asp lie His Gin Tyr 
85 90 95 

Arg Glu Gin lie Lys Arg Val Lys Asp Ser Asp Asp Val Pro Met Val 
100 105 110 

Leu Val Gly Asn Lys Cys Asp Leu Ala Ala Thr Val Glu Ser Arg Gin 
115 120 125 

Ala Gin Asp Leu Ala Arg Ser Tyr Gly lie Pro Tyr lie Glu Thr Ser 
130 135 140 

Ala Lys Thr Arg Gin Gly Val Glu Asp Ala Phe Tyr Thr Leu Val Arg 
145 150 155 ~ 160 

Glu lie Arg Gin His Lys Leu Arg Lys Leu Asn Pro Pro Asp Glu Ser 
165 170 175 

Gly Pro Gly Cys Met Ser Cys Lys Cys Val Leu Ser 
180 185 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
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Thr lie Glu Asp Ser Tyr Arg Lys Gin Val Val lie Asp 
1 5 10 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Tvr Arq Glu Gin He Lys Arg Val Lys Asp Ser Asp Asp Val Pro 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Gly Asn Lys Cys Asp Leu Ala Ala Arg Thr Val Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Cys Asp Leu Ala Ala Arg Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 amino acids 



WO 97/10836 



PCT/US96/15098 



56 

<B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Ala Pro Pro Ala Arg Ala Arg Ala Asp Tyr Asp Tyr Leu lie 
1 5 10 "* 15 

Lys Leu Leu Leu lie Gly Asp Ser Gly Val Gly Lys Ser Cys Leu Leu 
20 25 30 

Leu Arg Phe Ser Asp Gly Ser Phe Thr Thr Ser Phe He Thr Thr He 
35 40 45 

Gly He Asp Phe Lys He Arg Thr He Glu Leu Asp Gly Lys Arg He 
50 55 60 

Lys Leu Gin He Trp Asp Thr Ala Gly Gin Glu Arg Arg Thr He Thr 
65 70 75 " 80 

Thr Ala Tyr Tyr Arg Gly Ala Met Gly He Leu Leu Val Tyr Asp Val 
85 90 95 

Thr Asp Glu Ser Ser Phe Asn Asn He Arg Asn Trp He Arg Asn He 
100 105 no 

Glu Gin His Ala Ser Asp Asn Val Asn Lys He Leu Val Gly Asn Lys 
115 12 0 12 5 

Ala Asp Met Asp Glu Ser Lys Arg Ala Val Pro Thr Ala Lys Gly Gin 
130 135 140 

Ala Leu Ala Asp Glu Tyr Gly He Lys Phe Phe Glu Thr Ser Ala Lys 
145 150 155 160 

Thr Asn Leu Asn Val Glu Glu Val Phe Phe Ser He Gly Arg Asp He 
165 170 175 

Lys Gin Arg Leu Ser Asp Thr Asp Ser Arg Ala Glu Pro Ala Thr He 
180 185 190 

Lys He Ser Gin Thr Asp Gin Ala Ala Gly Ala Gly Gin Ala Thr Gin 
195 200 205 

Lys Ser Ala Cys Cys Gly Thr 
210 215 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Ala Gly Tyr Ala Asp Glu Glu Tyr Asp Tyr Leu Phe Lys Leu Val 
5 10 15 



Leu lie Gly Asp Ser Gly Val Gly Lys Ser Asn Leu Leu Ser Arg Phe 
20 25 30 

Thr Lys Asn Phe Asn Leu Glu Ser Lys Ser Thr He Gly Val Glu Phe 



40 4 5 

Ala Thr Lys Thr Thr Lys Val Glu Gly Lys Val Val Lys Ala Gin He 
50 55 60 

Trp Asp Thr Ala Gly Gin Glu Arg Tyr Arg Ala He Thr Ser Ala Tyr 
65 ^ 70 75 80 

Tvr Ara Gly Ala Val Gly Ala Leu Leu He Tyr Asp Val Thr Arg His 
85 90 95 

Ala Thr Phe Glu Asn Ala Ala Arg Trp Leu Arg Glu Leu Arg Gly His 
100 105 HO 

Thr Asp Pro Asn He Val Val Met Leu He Gly Asn Lys Cys Asp Leu 
115 120 125 

Arg His Leu Val Ala Val Lys Thr Glu Glu Ala Lys Ala Phe Ala Glu 
130 135 140 

Ara Glu Ser Leu Tyr Phe Met Glu Thr Ser Ala Leu Asp Ala Thr Asn 
145 150 155 160 

Val Glu Asn Ala Phe Thr Glu Val Leu Thr Gin He His Lys He Val 
165 170 175 

Ser Lys Arq Ser Val Asp Gly Gly Gly Ser Ala Asp Leu Pro Gly Lys 
= ~ 185 190 



180 



Gly Glu Thr He Asn Val Lys Glu Asp Gly Ser Val Leu Lys Arg Met 
195 200 205 

Gly Cys Cys Ser Asn 
210 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ser Ser Asp Asp Glu Gly Arg Glu Glu Tyr Phe Lys lie Val Val 
1 5 10 15 

lie Gly Asp Ser Ala Val Gly Lys Ser Asn Leu Leu Ser Arg Tyr Ala 
20 25 30 

Arg Asn Glu Phe Ser Ala Asn Ser Lys Ala Thr lie Gly Val Glu Phe 
35 40 45 

Gin Thr Gin Ser Met He Glu Gly Lys Glu Val Lys Ala Gin He Trp 
50 55 60 

Asp Thr Ala Gly Gin Glu Phe Arg Ala Val Thr Ser Tyr Tyr Arq Gly 
65 70 75 80 

Ala Val Gly Ala Leu Val Val Tyr Asp He Thr Arg Arg Thr Thr Phe 
85 90 95 

Glu Ser Val Gly Arg Trp Leu Asp Glu Leu Lys He His Ser Asp Thr 
100 105 no 

Thr Val Ala Arg Met Leu Val Gly Asn Lys Cys Asp Leu Glu Asn He 
115 120 125 

Arg Ala Val Ser Val Glu Glu Gly Lys Ala Leu Ala Glu Glu Glu Gly 
130 135 ~ 140 

Leu Phe Phe Val Glu Thr Ser Ala Leu Asp Ser Thr Asn Val Lys Thr 
145 150 155 160 

Ala Phe Glu Met Val He Leu Asp He Tyr Asn Asn Val Ser Arg Lys 
165 170 175 

Gin Leu Asn Ser Asp Thr Tyr Lys Asp Glu Leu Thr Val Arg Val Ser 
180 185 190 

Leu Val Lys Asp Asp Asn Ser Ala Ser Lys Gin Ser Ser Gly Phe Ser 
195 200 205 

Cys Cys Ser Ser Thr 
210 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 01 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Discopyge ommata 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Asn Pro Glu Tyr Asp Tyr Leu Phe Lys Leu Leu Leu He Gly Asp 
15 10 15 

Ser Gly Val Gly Lys Ser Cys Leu Leu Leu Arg Phe Ala Asp Asp Thr 
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20 25 30 

Tyr Thr Glu Ser Tyr He Ser Thr He Gly Val Asp Phe Lys He Arg 
35 40 45 

Thr He Glu Leu Asp Gly Lys Thr He Lys Leu Gin He Trp Asp Thr 
50 55 60 

Ala Gly Gin Glu Arg Phe Arg Thr He Thr Ser Ser Tyr Tyr Arg Gly 
65 70 75 80 

Ala His Gly He He Val Val Tyr Asp Val Thr Asp Gin Glu Ser Phe 
85 90 95 

Asn Asn Val Lys Gin Trp Leu Gin Glu He Asp Arg Tyr Ala Ser Glu 
100 ' 105 HO 

Asn Val Asn Lys Leu Leu Val Gly Asn Lys Cys Asp Leu Thr Thr Lys 
115 120 125 

Lys Val Val Asp Tyr Thr Thr Lys Glu Phe Ala Asp Ser Leu Gly He 
130 ^ 135 140 

Pro Phe Leu Glu Thr Ser Ala Lys Asn Ala Thr Asn Val Glu Gin Ala 
145 150 155 160 

Phe Met Thr Met Ala Ala Glu He Lys Lys Arg Met Gly Pro Gly Ala 
165 170 175 

Thr Ser Gly Gly Ser Glu Lys Ser Asn Val Asn He Gin Ser Thr Pro 
180 185 190 

Val Lys Ser Ser Gly Gly Gly Cys Cys 
195 200 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 02 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lymnea stagnalis 



(xi) SEQUENCE DESCRIPTION: SEQ II 

Met Ser Thr Met Asn Pro Asp Tyr 
1 5 

He Gly Asp Ser Gly Val Gly Lys 
2 0 

Asp Asp Thr Tyr Thr Glu Ser Tyr 
35 40 

Lys He Arg Thr He Glu Leu Asp 
50 ^ 55 

Trp Asp Thr Ala Gly Gin Glu Arg 



> NO: 14 : 

Asp Tyr Leu Phe Lys Leu Leu Leu 
10 15 

Ser Cys Leu Leu Leu Arg Phe Ala 
25 * 30 

He Ser Thr He Gly Val Asp Phe 
45 

Gly Lys Thr He Lys Leu Gin He 
60 

Phe Arg Thr He Thr Ser Ser Tyr 
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65 70 75 80 

Tyr Arg Gly Ala His Gly lie lie Val Val Tyr Asp Val Thr Asp Gin 
85 90 95 

Glu Ser Phe Asn Asn Val Lys Gin Trp Leu Gin Glu lie Asp Arg Tyr 
100 105 110 

Ala Ser Glu Asn Val Asn Lys Leu Leu Val Gly Asn Lys Ser Asp Leu 
115 120 ' 125 

Thr Thr Lys Lys Val Asp Phe Thr Thr Ala Lys Glu Tyr Ala Asp Gin 
130 135 140 

Leu Gly lie Pro Phe Leu Glu Thr Ser Ala Lys Asn Ala Thr Asn Val 
145 150 155 160 

Glu Gin Ala Phe Met Thr Met Ala Ala Glu lie Lys Asn Arg Met Gly 
165 170 175 

Pro lie Thr Ala Ser Asp Ser Lys Pro Ser Val Lys lie Asn Ser Ser 
180 185 190 

Thr Pro Ser Ala Asn Lys Gly Gly Cys Cys 
195 200 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Ala Tyr Ala Tyr Leu Phe Lys Tyr lie lie lie Gly Asp Thr Gly 
IS 10 15 

Val Gly Lys Ser Cys Leu Leu Leu Gin Phe Thr Asp Lys Arg Phe Gin 
20 25 30 

Pro Val His Asp Leu Thr lie Gly Val Glu Phe Gly Ala Arg Met lie 
35 40 45 

Thr lie Asp Gly Lys Gin lie Lys Leu Gin lie Trp Asp Thr Ala Gly 
50 55 60 

Gin Glu Ser Phe Arg Ser lie Thr Arg Ser Tyr Tyr Arg Gly Ala Ala 
65 70 75 * 80 

Gly Ala Leu Leu Val Tyr Asp lie Thr Arg Arg Asp Thr Phe Asn His 
85 90 ~ 95 

Leu Thr Thr Trp Leu Glu Asp Ala Arg Gin His Ser Asn Ser Asn Met 
100 105 110 

Val lie Met Leu lie Gly Asn Lys Ser Asp Leu Glu Arg Arg Glu Val 
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Lys Lys Glu Glu Gly Glu Ala Phe 
130 135 

Glu Thr Ala Lys Thr Ala Ser Val 
145 " 150 

Lys Glu He Tyr Glu Lys He Gin 
165 

Glu Ala Asn Gly He Lys He Gly 
180 

Thr His Ala Gly Asn Gin Gly Gly 
195 200 



1 



125 

Ala Glu His Gly Leu He Phe Met 
140 

Glu Glu Ala Phe He Asn Thr Ala 
155 160 

Glu Gly Val Phe Asp He Asn Asn 
170 175 

Pro Gin His Ala Ala Thr Asn Ala 
185 190 

Gin Gin Ala Gly Gly Gly Cys Cys 
205 



(2) INFORMATION FOR SEQ ID NO : 16 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lymnea stagnalis 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 16 : 

Met Ser Tyr Ala Tyr Leu Phe Lys Tyr He He He Gly Asp Thr Gly 
1 5 io 15 

Val Gly Lys Ser Cys Leu Leu Leu Gin Phe Thr Asp Lys Arg Phe Gin 
20 ' 25 30 

Pro Val His Asp Leu Thr He Gly Val Glu Phe Gly Ala Arg Met He 
35 40 45 

Thr He Asp Gly Lys Gin He Lys Leu Gin He Trp Asp Thr Ala Gly 

5 5 60 



50 



Gin Glu Ser Phe Arg Ser He Thr Arg Ser Tyr Tyr Arg Gly Ala Ala 
65 70 75 80 

Gly Ala Leu Leu Val Tyr Asp He Thr Arg Arg Asp Thr Phe Asn His 
85 90 95 

Leu Thr Thr Trp Leu Glu Asp Ala Arg Gin His Ser Asn Ser Asn Met 
100 105 HO 

Val He Met Leu He Gly Asn Lys Ser Asp Leu Glu Ala Arg Arg Val 
115 120 125 

Lys Lys Glu Glu Gly Glu Ala Phe Arg Glu His Gly Leu He Phe Met 
130 135 140 

Glu Thr Ser Ala Lys Thr Ala Ala Asn Val Glu Glu Ala Phe He Asn 
145 150 155 160 
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Thr Ala Lys Glu lie Tyr Gin Lys lie Gin Asp Gly Val Phe Asp lie 
165 170 175 

Asn Asn Glu Ala Asn Gly lie Lys lie Gly Pro Gin His Ser Pro Ala 
180 185 190 

Ser Gin Ser Leu Asn Val Gly Gly Ser Gly Gly Asn Gin Gly Gly Asn 
195 200 " 205 

Cys Cys 
210 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Oryctolagus cuniculus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Ala Tyr Ala Tyr Leu Phe Lys Tyr lie lie lie Gly Asp Thr Gly 
1 5 10 15 

Val Gly Lys Ser Cys Leu Leu Leu Gin Phe Thr Asp Lys Arg Phe Gin 
20 25 30 

Pro Val His Asp Leu Thr lie Gly Val Glu Phe Gly Ala Arg Met lie 

35 40 45 

Thr lie Asp Gly Lys Gin lie Lys Leu Gin lie Trp Asp Thr Ala Gin 
50 55 60 

Glu Ser Phe Arg Ser lie Arg Ser Tyr Tyr Arg Gly Ala Gly Ala Leu 
65 70 - 75 ^ " 80 

Leu Val Tyr Asp lie Thr Arg Arg Asp Thr Phe Asn His Leu Thr Thr 
85 90 95 

Trp Leu Glu Asp Ala Arg Gin His Ser Asn Ser Asn Met Val lie Met 
100 105 no 

Leu lie Gly Asn Lys Ser Asp Leu Glu Ser Arg Arg Glu Val Lys Lys 

115 120 125 

Glu Glu Gly Glu Ala Phe Ala Arg Glu His Gly Leu lie Phe Met Glu 
130 135 140 

Thr Ser Ala Lys Thr Ala Ser Asn Val Glu Glu Ala Phe lie Asn Thr 
145 150 155 160 

Ala Lys Glu lie Tyr Glu Lys He Gin Glu Gly Val Phe Asp He Asn 
165 170 175 

Asn Glu Ala Asn Gly He Lys He Gly Pro Gin His Gly Ala Thr Asn 
180 185 190 



i 

r 
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Ala His Ala Gly Asn Gin Gly Gly Gin Gin Ala Gly Gly Gly Cys Cys 
X95 J 200 205 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus norvegicus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Met Ala Tyr Ala Tyr Leu Phe Lys Tyr He He He Gly Asp Thr Gly 
1 5 10 15 

Val Gly Lys Ser Cys Leu Leu Leu Gin Phe Thr Asp Lys Arg Phe Gin 
20 J 25 30 

Pro Val His Asp Leu Thr Met Gly Val Glu Phe Gly Ala Arg Met He 
35 40 45 

Thr He Asp Gly Lys Gin He Lys Leu Gin He Trp Asp Thr Ala Gly 
50 * 55 60 

Gin Glu Ser Phe Arg Ser He Thr Arg Ser Tyr Tyr Arg Gly Ala Ala 
65 70 75 80 

Glv Ala Leu Leu Val Tyr Asp He Thr Arg Arg Asp Thr Phe Asn His 
85 90 95 

Leu Thr Thr Trp Leu Glu Asp Ala Arg Gin His Ser Asn Ser Asn Met 
100 *" 105 HO 

Val He Met Leu He Gly Asn Lys Ser Asp Leu Glu Ser Arg Arg Glu 

H5 120 125 

Val Lys Lys Glu Glu Gly Glu Ala Phe Ala Arg Glu His Gly Leu He 
130 135 140 

Phe Met Glu Thr Ser Ala Lys Thr Ala Ser Asn Val Glu Glu Ala Phe 
145 150 155 160 

He Asn Thr Ala Lys Glu He Tyr Glu Lys He Gin Glu Gly Val Phe 
165 170 175 

Asp He Asn Asn Glu Ala Asn Gly He Lys He Gly Pro Gin His Ala 
180 185 190 

Ala Thr Asn Ala Ser His Gly Gly Asn Gin Gly Gly Gin Gin Ala Gly 
195 200 205 

Gly Gly Cys Cys 
210 

(2) INFORMATION FOR SEQ ID NO: 19: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: NO 



<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Drosophila melanogaster 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Ala Gly Gly Asp Pro Lys Trp Gin Lys Asp Ala Ala Asp Gin Asn 
15 10 15 

Phe Asp Tyr Met Phe Lys Leu Leu lie lie Gly Asn Ser Ser Val Gly 
20 2 5 3 0 

Lys Thr Ser Phe Leu Phe Arg Tyr Ala Asp Asp Ser Phe Thr Ser Ala 
35 40 45 

Phe Val Ser Thr Val Gly lie Asp Phe Lys Val Lys Thr Val Phe Arg 
50 55 60 

His Asp Lys Arg Val Lys Leu Gin He Trp Asp Thr Ala Gly Gin Glu 
65 70 75 80 

Arg Tyr Arg Thr He Thr Thr Ala Tyr Tyr Arg Gly Ala Met Gly Phe 
85 90 95 

He Leu Met Tyr Asp Val Thr Asn Glu Asp Ser Phe Asn Ser Val Gin 
100 105 HO 

Asp Trp Val Thr Gin He Lys Thr Tyr Ser Trp Asp Asn Ala Gin Val 
115 120 125 

He Leu Val Gly Asn Lys Cys Asp Met Glu Asp Gin Arg Val He Ser 
130 135 140 

Phe Glu Arg Gly Arg Gin Leu Ala Asp Gin Leu Gly Val Glu Phe Phe 
145 150 155 160 

Glu Thr Ser Ala Lys Glu Asn Val Asn Val Lys Ala Val Phe Glu Arg 
165 170 ' 175 

Leu Val Asp He He Cys Lys Met Ser Glu Ser Leu Asp Ala Asp Pro 
180 185 190 

Thr Leu Val Gly Gly Gly Gin Lys Gly Gin Arg Leu Thr Asp Gin Pro 
195 200 " 205 

Gin Gly Thr Pro Asn Ala Asn Cys Asn Cys 
210 215 



(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 08 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus norvegicus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Ser Glu Thr Tyr Asp Phe Leu Lys Phe Leu Val lie Gly Asn Ala 
! 5 10 15 

Gly Thr Gly Lys Ser Cys Leu Leu His Gin Phe lie Glu Lys Lys Phe 
20 * 25 30 

Lvs Asp Asp Ser Asn His Thr He Gly Val Glu Phe Gly Gin Lys He 
35 40 45 

He Asn Val Gly Gly Lys Tyr Val Lys Leu Gin He Trp Asp Thr Ala 
50 55 60 

Gly Gin Glu Arg Phe Arg Val Thr Thr Ser Tyr Arg Gly Ala Ala Gly 
65 " 70 75 80 

Ala Leu Leu Val Tyr Asp He Thr Ser Arg Glu Thr Tyr Asn Ala Leu 
85 90 95 

Thr Asn Trp Leu Thr Asp Ala Arg Met Leu Ala Ser Gin Asn He Val 
100 . 105 HO 

He Cys Gly Asn Lys Lys Asp Leu Asp Ala Asp Arg Glu Val Thr Phe 
115 120 125 

Leu Glu Ala Ser Arg Phe Ala Gin Glu Asn Glu Leu Met Phe Leu Glu 
130 " 135 140 

Thr Ser Ala Leu Thr Gly Glu Asn Val Glu Glu Ala Phe Met Gin Cys 
145 150 155 160 

Ala Arg Lys He Leu Asn Lys He Glu Ser Gly Glu Leu Asp Pro Glu 
165 170 175 

Arg Met Gly Ser Gly He Gin Tyr Gly Asp Ala Ala Leu Arg Gin Leu 
180 185 190 

Arg Ser Pro Arg Arg Thr Gin Ala Pro Ser Ala Gin Glu Cys Gly Cys 
195 ~ 200 205 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 03 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Caenorhabditis elegans 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

Met Ala Asp Phe Thr Asn Asn Ala Leu Lys Lys Phe Lys Leu Val Phe 
1 5 10 15 

Leu Gly Glu Gin Ser Val Gly Lys Thr Ser lie lie Thr Arg Phe Met 
20 25 30 

Tyr Asp Ser Phe Asp Asn Thr Tyr Gin Ala Thr lie Gly lie Asp Phe 
35 40 45 

Leu Ser Lys Thr Met Tyr Leu Glu Asp Arg Thr lie Arg Leu Gin Leu 
50 55 60 

Trp Asp Thr Ala Gly Gin Glu Arg Phe Arg Ser Leu lie Pro Ser Tyr 
65 ^ 70 75 80 

lie Arg Asp Ser Ser Val Ala Val Val Val Tyr Asp lie Thr Asn Ala 
85 90 95 

Asn Ser Phe His Gin Thr Thr Lys Trp Val Asp Asp Val Arg Asn Glu 
100 105 110 

Arg Gly Cys Asp Val lie lie Val Leu Val Gly Asn Lys Thr Asp Leu 
115 120 125 

Ala Asp Lys Arg Gin Val Ser Thr Glu Asp Gly Glu Lys Lys Ala Arg 
130 135 140 

Asp Leu Asn Val Met Phe lie Glu Thr Ser Ala Lys Ala Gly Tyr Asn 
145 150 155 160 

Val Lys Gin Leu Phe Arg Lys lie Ala Leu Pro Gly lie Val Gin Glu 
165 170 175 

Glu Thr Pro Glu Gin Pro Asn lie Val lie Met Asn Pro Pro Lys Asp 
180 185 190 

Ala Glu Glu Ser Gin Gly Arg Gin Cys Pro Cys 
195 200 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 07 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Ser Thr Gly Gly Asp Phe Gly Asn Pro Leu Arg Lys Phe Lys Leu 
15 10 15 

Val Phe Leu Gly Glu Gin Ser Val Gly Lys Thr Ser Leu lie Thr Arg 
20 25 30 

Phe Met Tyr Asp Ser Phe Asp Asn Thr Tyr Gin Ala Thr lie Gly lie 



i 

! 
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35 40 45 

Asp Phe Leu Ser Lys Thr Met Tyr Leu Glu Asp Arg Thr Val Arg Leu 
50 55 60 

Gin Leu Trp Asp Thr Ala Gly Gin Glu Arg Phe Arg Ser Leu He Pro 
65 " 70 75 80 

Ser Tyr He Arg Asp Ser Thr Val Ala Val Val Val Tyr Asp He Thr 
85 9° 95 

Asn Val Asn Ser Phe Gin Gin Thr Thr Lys Trp He Asp Asp Val Arg 
100 105 HO 

Thr Glu Arg Gly Ser Asp Val He He Met Leu Val Gly Asn Lys Thr 
U 5 " 120 125 

Asp Leu Ala Asp Lys Arg Gin Val Ser He Glu Glu Gly Glu Arg Lys 



130 



Ala Lys Glu Leu Asn Val Met Phe He Glu Ser Ala Lys Ala Gly Tyr 

150 155 160 



145 



Asn Val Lys Gin Leu Phe Arg Arg Val Ala Ala Ala Leu Pro Gly Met 
165 170 175 

Glu Ser Thr Gin Asp Arg Ser Arg Glu Asp Met He Asp He Lys Leu 
180 185 190 

Glu Lys Pro Gin Glu Gin Pro Val Ser Glu Gly Gly Cys Ser Cys 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Canis familiaris 



(xi) SEQUENCE DESCRIPTION: SEQ IE 

Met Thr Ser Arg Lys Lys Val Leu 
1 5 

Ser Gly Val Gly Lys Thr Ser Leu 
2 0 

Phe Ser Asn Gin Tyr Lys Ala Thr 
35 40 

Glu Val Met Val Asp Asp Arg Leu 
50 55 

Ala Gly Gin Glu Arg Phe Gin Ser 
65 70 

Asp Cys Cys Val Leu Val Phe Asp 



' NO: 23 : 

Leu Lys Val He He Leu Gly Asp 
10 15 

Met Asn Gin Tyr Val Asn Lys Lys 
25 30 

He Gly Ala Asp Phe Leu Thr Lys 
45 

Val Thr Met Gin He Trp Asp Thr 
60 

Leu Gly Val Phe Tyr Arg Gly Ala 
75 80 

Val Thr Ala Pro Asn Thr Phe Lys 
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85 90 95 

Thr Leu Asp Ser Trp Arg Asp Glu Phe Leu lie Gin Ala Ser Pro Arg 
100 105 110 

Asp Pro Glu Asn Phe Pro Phe Val Val Leu Gly Asn Lys lie Asp Leu 
115 120 125 

Glu Asn Arg Gin Val Ala Thr Lys Arg Ala Gin Ala Trp Cys Tyr Ser 
130 135 140 

Lys Asn Asn lie Pro Tyr Phe Glu Thr Ser Ala Lys Glu Ala lie Asn 
145 150 155 160 

Val Glu Gin Ala Phe Gin Thr lie Ala Arg Asn Ala Leu Lys Gin Glu 
165 170 175 

Thr Glu Val Glu Leu Tyr Asn Glu Phe Pro Glu Pro lie Lys Leu Asp 
180 185 190 

Lys Asp Ala Lys Thr Ser Ala Glu Cys Ser Cys 
195 2 00 

(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 202 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Dictyostel ium discoideum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Thr Lys Lys Lys Val Leu Leu Lys Val lie lie Leu Gly Asp Ser 
1 5 10 15 

Gly Val Gly Lys Thr Ser Leu Met Asn Gin Tyr Val Asn Lys Lys Phe 
20 25 30 

Ser Asn Gin Tyr Lys Ala Thr He Gly Ala Asp Phe Leu Thr Lys Glu 
35 40 45 

Leu Met Val Asp Asp Arg Val Val Thr Met Gin He Trp Asp Thr Ala 
50 55 60 

Gly Gin Glu Arg Phe Gin Ser Leu Gly Val Ala Phe Tyr Arg Gly Ala 
65 70 75 80 

Asp Cys Cys Val Leu Cys Tyr Asp Val Asn Val Ala Lys Thr Phe Glu 
85 90 95 

Asn Leu Asp Ser Trp Arg Asp Glu Phe Leu He Gin Ala Gly Pro Arg 
100 105 no 

Asp Pro Asp Asn Phe Pro Phe Val Val Leu Gly Asn Lys lie Asp Leu 
115 120 125 

Glu Asn Gin Arg Val Val Ser Gin Lys Arg Ala Ala Ser Trp Cys Gin 
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130 135 140 



Ser Lys Gly Asn He Pro Tyr Phe Glu Thr Ser Ala Lys Glu Ala lie 
145 150 155 lbU 

Asn Val Glu Gin Ala Phe Gin Thr He Ala Arg Asn Ala He Lys Leu 



165 



Glu Asp Gly Leu Val Phe Pro He Pro Thr Asn He Gin Val He Pro 
180 185 190 

Glu Pro Gin Pro Ala Lys Ser Gly Cys Cys 
!95 200 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 205 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Canis familiaris 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Met Lys Thr Tyr Asp Tyr Leu Phe Lys Leu Leu Leu He Gly Asp Ser 
1 5 10 I 5 

Gly Val Gly Lys Thr Cys Val Leu Phe Arg Phe Ser Glu Asp Ala Phe 
20 25 30 

Asn Ser Thr Phe He Ser Thr He Gly He Asp Phe Lys He Arg Thr 
35 40 45 

He Glu Leu Asp Gly Lys Arg He Lys Leu Gin He Trp Asp Thr Ala 
50 55 60 

Gly Gin Glu Arg Phe Arg Thr He Thr Thr Ala Tyr Tyr Arg Ala Met 
6 5 70 75 

Gly He Met Leu Val Tyr Asp He Thr Asn Glu Lys Ser Phe Asp Asn 
85 90 95 

He Arg Asn Trp He Arg Asn He Glu Glu His Ala Ser Ala Asp Val 
!00 1°5 11° 

Glu Lys Met He Leu Gly Asn Lys Cys Asp Val Asn Asp Lys Arg Gin 
115 120 125 

Val Ser Lys Glu Arg Gly Glu Lys Leu Ala Leu Asp Tyr Gly He Lys 
130 ~ 135 140 

Phe Met Glu Thr Ser Ala Lys Ala Asn He Asn Val Glu Asn Ala Phe 
145 150 155 160 

Phe Thr Leu Ala Arg Asp He Lys Ala Lys Met Asp Lys Lys Leu Glu 
165 170 175 

Gly Asn Ser Pro Gin Gly Ser Asn Gin Gly Val Lys He Thr Pro Asp 
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180 185 190 

Gin Gin Lys Arg Ser Ser Phe Phe Arg Cys Val Leu Leu 
195 200 ~ 205 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Dictyostelium discoideum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Glu Glu Glu lie Leu Tyr Lys lie lie Leu Val Gly Glu Ser Gly 
1 5 10 15 

Val Gly Lys Ser Ser lie Leu Val Arg Phe Thr Asp Asn Thr Phe Ser 
20 25 30 

Gin His Phe Ala Pro Thr Leu Gly Val Phe Val Lys Thr lie Arg Asn 
35 40 45 

Lys Glu Thr Gly Gin Thr Val Lys Leu Gin Leu Trp Asp Thr Ala Gly 
50 55 60 

Gin Glu Arg Phe Lys Ser lie Thr Gin Phe Tyr Arg Gly Ser His Gly 
65 70 75 " 80 

Val lie Val Val Tyr Asp Val Thr Asp Pro Lys Ser Phe Glu Arg Leu 
85 90 95 

Lys Asn Trp Val Glu Asp lie Asn Gin Tyr Thr Gin Asp Gly Met lie 
100 105 ^ 110 

lie lie Leu Val Gly Asn Lys Ser Asp Met Val Ala Gin Arg Lys Val 
115 12 0 12 5 

Thr Phe Glu Gin Gly Gin Glu Met Ala Glu Gin Leu Lys Thr Lys Phe 
130 135 140 

Leu Glu Val Ser Ala Lys Glu Asn Asn Gly Val Thr Gin Val Phe Asp 
145 150 155 160 

Leu Leu Val Gin Asp lie Glu Ala Thr Met Lys Asn Ser Lys Val Ala 
165 170 ~ 175 

Gin Asn Gin Leu Asn Leu Ser Val Gly Gin Glu Arg Gly Cys Cys 
180 185 190 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Caenorhabditis elegans 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



Met 

1 


Gin 


Ala 


He 


Lys 
5 


Cys 


Val 


Val 


Val 


Gly Asp 
10 


Gly Ala 


Val 


Gly 
15 


Lys 


Thr 


Cys 


Leu 


Leu 
20 


He 


Ser 


Tyr 


Thr 


Thr 
25 


Asn 


Ala 


Phe 


Pro 


Gly 
30 


Glu 


Tyr 


He 


Pro 


Thr 
35 


Val 


Phe 


Asp 


Asn 


Tyr 
40 


Ser 


Ala 


Asn 


Val 


Met 
45 


Val 


Asp 


Gly 


Arg 


Pro 
50 


He 


Asn 


Leu 


Gly 


Leu 
55 


Trp 


Asp 


Thr 


Ala 


Gly 
60 


Gin 


Asp 


Tyr 


Asp 


Arg 
65 


Leu 


Arg 


Pro 


Leu 


Ser 
70 


Tyr 


Pro 


Gin 


Thr 


Asp 

75 


Val 


Phe 


Leu 


Val 


Cys 
80 


Phe 


Ala 


Leu 


Asn 


Asn 
85 


Pro 


Ala 


Ser 


Phe 


Glu 
90 


Asn 


Val 


Arg 


Ala 


Lys 
95 


Trp 


Tyr 


Pro 


Glu 


Val 
100 


Ser 


His 


His 


Cys 


Pro 
105 


Asn 


Thr 


Pro 


He 


He 
110 


Leu 


Val 


Gly 


Thr 


Lys 
115 


Ala 


Asp 


Leu 


Arg 


Glu 
120 


Asp 


Asp 


Thr 


Val 


Glu 
125 


Arg 


Leu 


Arg 


Glu 


Arg 
130 


Arg 


Leu 


Gin 


Pro 


Val 
135 


Ser 


Gin 


Thr 


Gin 


Gly Tyr 
140 


Val 


Met 


Ala 


Lys 
145 


Glu 


He 


Lys 


Ala 


Val 
150 


Lys 


Tyr 


Leu 


Glu 


Cys 
155 


Ser 


Ala 


Leu 


Thr 


Gin 
160 


Arg 


Gly 


Leu 


Lys 


Gin 
165 


Val 


Phe 


Asp 


Glu 


Ala 
170 


He 


Arg 


Ala 


Val 


Val 
175 


Thr 


Pro 


Pro 


Gin 


Arg 
180 


Ala 


Lys 


Lys 


Ser 


Lys 
185 


Cys 


Thr 


Val 


Leu 









(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Dictyostelium discoideum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 
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15 10 15 

Thr Cys Leu Leu lie Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 
20 25 30 

lie Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Gly 
35 40 45 

Lys Pro lie Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 
50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu lie 
65 70 75 80 

Cys Phe Ser lie lie Ser Pro Ser Ser Phe Glu Asn Val Asn Gly Lys 
85 90 95 

Trp His Pro Glu lie Cys His His Pro Asn Val Pro lie Leu Val Gly 
100 105 110 

Thr Lys Leu Asp Met Arg Asp Lys Glu Thr Gin Asp Arg Leu Lys Glu 
115 120 125 

Lys Lys Leu Tyr Pro lie Ser Tyr Glu Gin Gly Leu Ala Lys Met Lys 
13 0 13 5 14 0 

Glu lie Asn Ala Val Lys Tyr Leu Glu Cys Ser Ala Leu Thr Glu Lys 
145 150 155 160 

Gly Leu Lys Thr Val Phe Asp Glu Ala lie Arg Ala Val lie Asn Pro 
165 . 170 175 

Pro Leu Ser Lys Lys Lys Lys Ser Ser Gly Gly Cys Asn lie Leu 
180 185 ' ^ - 190 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Dictyostel ium discoideum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Gin Ser lie Lys Leu Val Val Val Gly Asp Gly Ala Val Gly Lys 

15 10 15 

Thr Cys Leu Leu lie Ser Tyr Thr Ser Asn Ser Phe Pro Thr Glu Tyr 
20 25 30 

Val Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Asn 
3 5 4 0 4 5 

Lys Thr Val Ser Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 
50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu lie 
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Cys Phe Ala lie 



Trp Pro Glu Val 
100 



Thr Lys Cys Asp 
115 

Lys His Gin Gin 
130 

Glu lie Lys Ala 
145 

Gly Leu Lys Gin 



Asp Arg Asp Lys 
180 



70 

lie Ser Gin Ser 
85 

Thr His His Cys 



Leu Arg Asp Lys 
120 

Pro Leu Thr Phe 
135 

Phe Cys Tyr Met 
150 

Val Phe Asp Glu 
165 

Ala Thr Asn Lys 



73 



75 

Tyr Thr Asn Val 
90 

Pro Asn Cys Thr 
105 

Glu Ser Leu Glu 



Gin Gin Gly Glu 
140 

Glu Cys Ser Ala 
155 

Ala lie Lys Ala 
170 

Lys Asn Ser Lys 
185 



80 

Lys Ser Lys Trp 
95 

He Leu Val Gly 
110 

Lys Leu Arg Glu 
125 

Gin Met Ala Lys 



Leu Thr Gin Lys 
160 

Val He Phe Pro 
175 

Cys Ser He Leu 
190 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Dictyostelium discoideum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Ser Ala Ala Glu Val He Lys Leu Val Val He Gly Gly Ala Val 
1 5 10 15 

Gly Lys Thr Cys Leu Leu He Tyr Ala Asn Asn Arg Phe Pro Glu Asp 
20 25 30 

Tyr He Pro Thr Val Phe Asp Asn Tyr Val Val Asn Leu Thr Ala Gly 
"35 4 0 4 5 

Asp Arg Asn He Glu Leu Gly Leu Trp Asp Thr Ala Gly Glu Tyr Asp 
50 55 60 

Lys Leu Arg Pro Leu Ser Tyr Ala Asn Asn Val Phe Leu He Cys Phe 
65 ~ 70 75 80 

Ser He Asn Pro Val Ser Phe Glu Asn Val Tyr Thr Lys Trp Tyr Pro 
85 90 95 

Glu Val Met His Phe Cys Pro Glu Val Gin He Leu Val Gly Thr Lys 
100 ^ 105 110 

Leu Asp Thr Arg Asp Asp Arg Gly Val Leu Asp Lys Leu Gin Gin Thr 
115 ~ ' 120 125 
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Gly His Lys Pro lie Thr Thr Glu 
130 135 

lie Lys Ala lie Lys Tyr Met Glu 
145 150 

Leu Lys Gin Val Phe Asp Glu Ala 
16 5 

Lys Lys Lys Ser Lys Cys lie Val 
180 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 205 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Homo sapiens 



74 



Gin Gly Asn Asp Leu Ala Arg Arg 
14 0 

Cys Ser Ala Lys Thr Ser Gin Asn 
155 160 

lie Lys Ser Val Leu Phe lie Lys 
170 175 

Met 
185 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

Met Ala Ala Asn Lys Pro Lys Gly Gin Asn Ser Leu Ala Leu His Lys 
15 10 is 

Val He Met Val Gly Ser Gly Gly Val Gly Lys Ser Ala Leu Thr Leu 
20 25 30 

Gin Phe Met Tyr Asp Glu Phe Val Glu Asp Tyr Glu Pro Thr Lys Ala 
35 40 45 

Asp Ser Tyr Arg Lys Lys Val Val Leu Asp Gly Glu Glu Val Gin He 
50 5 5 6 0 

Asp He Leu Asp Thr Ala Gly Gin Glu Asp Tyr Ala Ala He Arg Asp 
65 70 75 * eo 

Asn Tyr Phe Arg Ser Gly Glu Gly Phe Leu Cys Val Phe Ser He Thr 
85 90 95 

Glu Met Glu Ser Phe Ala Ala Thr Ala Asp Phe Arg Glu Gin He Leu 
100 105 ~ no 

Arg Val Lys Glu Asp Glu Asn Val Pro Phe Leu Leu Val Gly Asn Lvs 
H5 120 125 

Ser Asp Leu Glu Asp Lys Arg Gin Val Ser Val Glu Glu Ala Lys Asn 
130 135 140 

Arg Ala Glu Gin Trp Asn Val Asn Tyr Val Glu Thr Ser Ala Lvs Thr 
145 "0 155 7 i 6 o 

Arg Ala Asn Val Asp Lys Val Phe Phe Asp Leu Met Arg Glu He Arg 
165 170 ~ 175 

Ala Arg Lys Met Glu Asp Ser Lys Lys Asn Gly Lys Lys Lys Arg Lys 
180 185 ' 190 
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Ser Leu Ala Lys Arg lie Arg Glu Arg Cys Cys lie Leu 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Ala Ala Asn Lys Ser Lys Gly Gin Ser Ser Leu Ala Leu His Lys 
1 5 10 15 

Val lie Met Val Gly Ser Gly Gly Val Gly Lys Ser Ala Leu Thr Leu 
20 ' 25 30 

Gin Phe Met Tyr Asp Glu Phe Val Glu Asp Tyr Glu Pro Thr Lys Ala 
35 * ^ 40 45 

Asp Ser Tyr Arg Lys Lys Val Val Leu Asp Gly Glu Glu Val lie Asp 
50 ^ 55 60 

lie Leu Asp Thr Ala Gly Gin Glu Asp Tyr Ala lie Arg Asp Asn Tyr 
65 " 70 75 80 

Phe Arg Ser Gly Glu Gly Phe Leu Leu Val Phe Ser lie Thr Glu His 
85 90 95 

Glu Ser Phe Thr Ala Thr Ala Glu Phe Arg Glu Gin lie Leu Arg Val 
100 105 110 

Lys Ala Glu Glu Asp Lys lie Pro Leu Leu Val Val Gly Asn Lys Ser 
115 ~ " 120 125 

Asp Leu Glu Glu Arg Arg Gin Val Pro Val Glu Glu Ala Arg Ser Lys 
130 135 140 

Ala Glu Glu Trp Gly Val Gin Tyr Val Glu Thr Ser Ala Lys Thr Arg 
145 150 155 160 

Ala Asn Val Asp Lys Val Phe Phe Asp Leu Met Arg Glu He Arg Thr 
165 170 175 

Lys Lys Met Ser Glu Asn Lys Asp Lys Asn Gly Lys Lys Ser Ser Lys 
180 185 190 

Asn Lys Lys Ser Phe Lys Glu Arg Cys Cys Leu Leu 
195 200 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 00 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Discopyge ommata 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Met Ala Ala Asn Lys Asn Lys Asn Gin Ser Ser Leu Leu Lys Val lie 
1 5 10 



15 



Met Val Gly Ser Gly Gly Val Gly Lys Ser Ala Leu Thr Leu Gin Phe 
20 25 30 

Met Tyr Asp Glu Phe Val Glu Asp Tyr Glu Pro Thr Lys Ala Asp Ser 
35 40 45 

Tyr Arg Lys Lys Val Val Leu Asp Gly Glu Val Gin He Asp He Leu 
o U 5 5 



60 



Asp Thr Ala Gly Gin Glu Asp Tyr Ala He Arg Asp Asn Tyr Phe Arg 

65 70 75 " 

Ser Gly Glu Gly Phe Leu Cys Val Phe Ser He Glu Gin Glu Ser Phe 
85 90 95 

Thr Ala Thr Val Glu Phe Arg Glu Gin He Leu Arg Val Lys Glu Glu 
100 105 no 

Asp Lys He Pro Leu Leu Leu Val Gly Asn Lys Ser Asp Leu Glu Asp 
115 120 " 125 

Arg Arg Gin Val Ser He Glu Glu Ala Arg Ser Lys Ala Glu Glu Trp 
13 0 135 140 

Gly Val Gin Tyr Val Glu Thr Ser Ala Lys Thr Arg Ala Asn Val Asp 
145 150 155 i 6 o 

Lys Val Phe Phe Asp Leu Met Arg Glu Val Arg Ala Lys Lys Met Ser 
165 170 * i 7 5 

Glu Asn Lys Asp Lys Asn Gly Lys Lys Ser Ser Arg Asn Lys Lys Ser 
180 185 J 190 

Leu Arg Glu Arg Cys Cys He Leu 
195 200 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Discopyge ommata 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 34 : 

Met Ala Lys Lys Thr Tyr Asp Leu Leu Phe Lys Leu Leu Leu lie Gly 
15 10 15 

Asp Ser Gly Val Gly Lys Thr Cys Val Leu Phe Arg Phe Ser Asp Asp 
20 25 30 

Ala Phe Asn Thr Thr Phe lie Ser Thr lie Gly lie Asp Phe Lys lie 
35 40 45 

Lys Thr Val Glu Leu His Gly Lys Lys lie Lys Leu Gin lie Trp Asp 
50 55 60 

Thr Ala Gly Gin Glu Arg Phe His Thr lie Thr Ser Tyr Tyr Arg Gly 
65 70 75 80 

Ala Met Gly lie Met Leu Val Tyr Asp lie Thr Asn Ala Lys Ser Phe 
85 90 95 

Glu Asn lie Ser Lys Trp Leu Arg Asn lie Asp Glu His Ala Asn Glu 
100 105 110 

Asp Val Glu Arg Met Leu Leu Gly Asn Lys Asp Met Glu Asp Lys Arg 
115 120 125 

Val Val Leu Lys Ser Lys Gly Gin lie Ala Glu His Ala lie Arg Phe 
130 135 140 

Phe Glu Thr Ser Ala Lys Ala Asn lie Asn lie Glu Lys Ala Phe Leu 
145 150 155 160 

Thr Leu Ala Glu Asp lie Leu Gin Lys Thr Pro Val Lys Glu Pro Asp 
165 170 175 

Arg Glu Asn Val Asp lie Ser Thr Gly Gly Gly Gly Leu Lys Lys Cys 
180 185 * 190 

Cys Ser 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 07 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Discopyge ommata 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Met Lys Thr Tyr Asp Tyr Leu Phe Lys Leu Leu Leu lie Gly Asp Ser 
15 10 15 

Gly Val Gly Lys Thr Cys Leu Leu Phe Arg Phe Ser Glu Asp Ala Phe 
20 25 30 
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Asn Thr Thr Phe lie Ser Thr lie Gly lie Asp Phe Lys lie Arg Thr 
35 40 45 

Val Glu Leu Asp Gly Lys Lys lie Lys Leu Gin lie Trp Asp Thr Ala 
50 55 60 

Gly Gin Glu Arg Phe Arg Thr lie Thr Ala Tyr Tyr Arg Gly Ala Met 
65 70 75 80 

Gly lie Met Lys Val Asp lie Thr Asn Glu Lys Ser Phe Asp Asn lie 
85 90 95 

Lys Asn Trp lie Arg Asn lie Glu Glu His Ala Ser Ser Asp Val Glu 
100 105 no 

Arg Met lie Leu Gly Asn Lys Cys Asp Met Asn Glu Lys Arg Gin Val 
115 120 125 

Ser Lys Glu Arg Gly Glu Lys Leu Ala lie Asp Tyr Gly lie Lys Phe 
130 135 140 

Leu Glu Thr Ser Ala Lys Ser Ser lie Asn Val Glu Glu Ala Phe lie 
145 150 155 160 

Thr Leu Ala Arg Asp lie Met Thr Lys Leu Asn Lys Lys Met Asn Glu 
165 170 " 175 

Asn Ser Leu Gin Glu Ala Val Asp Lys Leu Lys Ser Pro Pro Lys Lys 
18 0 18 5 190 

Pro Ser Gin Lys Lys Lys Gin Leu Ser Phe Arg Cys Ser Leu Leu 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Discopyge ommata 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Gly Thr Arg Asp Asp Glu Tyr Asp Tyr Leu Phe Lys Val Val Leu 
1 5 10 15 

lie Gly Asp Ser Gly Val Gly Lys Ser Asn Leu Leu Ser Arg Phe Thr 
20 25 30 

Arg Glu Phe Asn Leu Glu Ser Lys Ser Thr lie Gly Val Glu Phe Ala 
35 40 45 

Thr Arg Ser lie Gin Val Asp Gly Lys Thr lie Lys Gin lie Trp Asp 
50 55 60 

Thr Gly Gin Glu Arg Tyr Arg Ala lie Thr Ser Ala Tyr Tyr Arg Gly 
65 70 75 80 
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Ala Val Gly Ala Leu Leu Val Tyr Asp He Ala Lys His Leu Thr Tyr 
85 90 95 

Glu Asn Val Glu Arg Trp Leu Lys Glu Leu Arg Asp His Ala Asp Asn 
100 105 110 

Asn He Val He Met Leu Val Gly Asn Lys Ser Asp Leu Arg His Leu 
115 120 125 

Arg Val Pro Thr Asp Ala Arg Ala Phe Ala Glu Lys Asn Asn Leu Ser 
130 135 140 

Phe He Glu Thr Ser Ala Leu Asp Ser Thr Asn Val Glu Glu Ala Phe 
145 150 155 160 

Lys Asn He Leu Thr Glu lie Tyr Arg He Val Ser Gin Lys Gin He 
165 170 175 

Ser Asp Arg Ser Ala His Asp Glu Ser Pro Gly Asn Asn Val Val Asp 
180 185 190 

He Ser Val Pro Pro Thr Thr Asp Gly Gin Lys Ser Asn Lys Leu Gin 
195 200 205 

Cys Cys Gin Asn Met 
210 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Dictyostelium discoideum 



(xi) SEQUENCE DEi 

Met Pro Leu Arg 
1 

Lys Ser Ala Leu 
20 

Tyr Asp Pro Thr 
35 

Ser Asn Gin Cys 
50 

Phe Thr Met Arg 

65 

Val Tyr Ser He 



Arg Glu Gin He 
100 



CRIPTION: SEQ I] 

Phe Lys He Val 
5 

Thr Val Gin Phe 



He Glu Asp Ser 
40 

Met Leu Glu He 
55 

Asp Leu Tyr Met 
70 

He Ser Asn Ser 
85 

Leu Arg Val Lys 



) NO: 37 : 

Val Leu Gly Ser 
10 

Val Gin Gly He 
25 

Tyr Arg Lys Gin 



Leu Asp Thr Ala 

60 

Lys Asn Gly Gin 
75 

Thr Phe Asn Glu 
90 

Asp Cys Glu Asp 
105 



Gly Gly Val Gly 
15 

Phe Val Glu Lys 
30 

Val Glu Val Asp 
45 

Gly Thr Glu Gin 



Gly Phe Val Leu 

80 

Leu Pro Asp Leu 

95 

Val Pro Met Val 
110 
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Leu Val Gly Asn Lys Cys Asp Leu His Asp Gin Arg Val lie Ser Thr 
115 120 125 

Glu Gin Gly Glu Glu Leu Ala Arg Lys Phe Gly Asp Cys Tyr Phe Leu 
130 135 140 

Glu Ala Ser Ala Lys Asn Lys Val Asn Val Glu Gin He Phe Tyr Asn 
145 I 50 155 160 

Leu He Arg Gin He Asn Arg Lys Asn Pro Val Gly Pro Pro Ser Lys 
165 170 " 175 

Ala Lys Ser Lys Cys Ala Leu Leu 
180 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Arg Glu Tyr Lys Val Val Val Leu Gly Ser Gly Gly Val Gly Lys 
1 5 io " " 15 

Ser Ala Leu Thr Val Gin Phe Val Thr Gly Thr Phe He Glu Lys Tvr 
20 25 30 

Asp Pro Thr He Glu Asp Phe Tyr Arg Lys Glu He Glu Val Asp Ser 
35 40 45 

Ser Pro Ser Val Leu Glu He Leu Asp Thr Ala Gly Thr Glu Gin Phe 
50 55 60 

Ala Ser Arg Asp Leu Tyr He Lys Asn Gly Gin Gly Phe He Leu Val 
65 70 75 80 

Tyr Ser Leu Val Asn Gin Gin Phe Gin Asp He Lys Pro Met Arcr Asd 
85 90 95 

Gin He He Arg Val Lys Tyr Glu Lys Val Pro Val He Leu Val Gly 
100 105 no 

Asn Lys Val Asp Leu Glu Ser Glu Arg Glu Val Ser Ser Ser Glu Gly 
115 120 125 

Arg Ala Leu Ala Glu Glu Trp Gly Cys Pro Phe Met Glu Thr Ser Ala 
130 135 140 

Lys Ser Lys Thr Met Val Asp Glu Leu Phe Ala Glu He Val Arcr Gin 
145 150 155 i 6 o 

Met Asn Tyr Ala Ala Gin Pro Asp Lys Asp Asp Pro Cys Cys Ser Ala 
165 170 " ^ 175 
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Cys Asn Gin 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Arg Glu Tyr Lys Val Val Val Leu Gly 
1 5 10 

Ser Ala Leu Thr Val Gin Phe Val Thr Gly 
20 25 

Asp Pro Thr lie Glu Asp Phe Tyr Arg Lys 
35 40 

Ser Pro Ser Val Leu Glu lie Leu Asp Thr 
50 55 

Ala Ser Met Arg Asp Leu Tyr lie Lys Asn 
65 70 

Val Tyr Ser Leu Val Asn Gin Gin Ser Phe 
85 90 

Arg Asp Gin lie lie Arg Val Lys Arg Tyr 
100 105 

Leu Val Gly Asn Lys Val Asp Leu Glu Gly 
115 120 

Gly Glu Gly Lys Ala Leu Ala Glu Glu Trp 
130 " " 135 

Thr Ser Ala Lys Asn Lys Ala Ser Val Asp 
145 150 

Val Arg Gin Met Asn Tyr Ala Ala Gin Ser 
165 170 



Cys Ser Ala Cys Val lie Leu 
180 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



Ser Gly Gly Val Gly Lys 
15 

Ser Phe lie Glu Lys Tyr 
30 

Glu lie Glu Val Asp Ser 
45 

Ala Gly Thr Glu Gin Phe 

60 

Gly Gin Gly Phe lie Leu 
75 80 

Gin Asp lie Lys Pro Met 
95 

Glu Arg Val Pro Met lie 
110 

Glu Arg Glu Val Ser Tyr 
125 

Ser Cys Pro Phe Met Glu 
140 

Glu Leu Phe Ala Glu He 
155 160 

Asn Gly Asp Glu Gly Cys 
175 
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(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Discopyge ommata 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Met Arg Glu Tyr Lys Leu Val Val Leu Gly Ser Gly Gly Val Gly Lys 

Ser Ala Leu Thr Val Gin Phe Val Gin Gly He Phe Val Glu Lvs Tvr 
20 25 30 

Asp Pro Thr He Glu Asp Ser Tyr Arg Lys Gin Val Glu Val Asp Cys 
35 40 45 

Gin Pro Cys Met Leu Glu He Leu Asp Thr Ala Gly Thr Glu Gin Phe 
50 55 ~ 60 

Thr Ala Met Arg Asp Leu Tyr Met Lys Asn Gly Gin Gly Phe Ala Leu 
65 70 75 ~ eo 

Val Tyr Ser He Thr Ala Gin Ser Thr Phe Asn Asp Leu Gin Asp Leu 
85 90 ~ 95 

Arg Glu Gin He Leu Arg Val Lys Asp Thr Glu Asp Val Pro Met He 
100 105 no 

Leu Val Gly Asn Lys Cys Asp Leu Glu Asp Glu Arg Val Val Gly Lys 
H5 120 125 

Glu Gin Gly Gin Asn Leu Ala Arg Gin Trp Asn Asn Cys Ala Phe Leu 
130 135 140 

Glu Ser Ser Ala Lys Ser Lys He Asn Val Asn Glu He Phe Tyr Asd 
145 150 155 160 

Leu Val Arg Gin He Asn Arg Lys Ala Pro Val Glu Lys Cys Lys Lvs 
165 170 175 

Lys Lys Ser Gin Cys Thr Leu Leu 
180 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Met Arg Glu Tyr Lys Leu Val Val Gly Ser Gly Gly Val Gly Lys Ser 
1 5 io * - is 
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Ala Leu Thr Val Gin Phe Val Gin Gly Phe Val Glu Lys Tyr Asp Pro 
20 25 " 30 

Thr lie Glu Asp Ser Tyr Arg Lys Gin Val Glu Val Asp Cys Gin Gin 
35 40 45 

Cys Met Leu Glu Asp Thr Ala Gly Thr Glu Gin Phe Thr Ala Met Arg 
50 55 60 

Asp Leu Tyr Met Lys Asn Gly Gin Gly Phe Ala Leu Val Tyr Ser lie 
65 70 75 80 

Thr Ala Gin Ser Thr Phe Asn Asp Leu Gin Asp Leu Arg Glu Gin lie 
85 90 95 

Leu Arg Val Lys Asp Thr Glu Asp Val Pro Met lie Leu Val Gly Asn 
100 105 110 

Lys Cys Asp Leu Glu Asp Glu Arg Val Val Gly Lys Glu Gin Gly Gin 
115 120 " 125 

Asn Leu Ala Arg Gin Trp Cys Asn Cys Ala Phe Leu Glu Ser Ser Ala 
130 135 140 

Lys Ser Lys lie Asn Val Asn Glu lie Phe Tyr Asp Leu Val Arg Gin 
145 150 155 160 

lie Asn Arg Lys Thr Pro Val Glu Lys Lys Lys Pro Lys Lys Lys Ser 
165 170 ' 175 

Cys Leu Leu Leu 
180 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Arg Glu Tyr Lys Leu Val Val Leu Gly Ser Gly Gly Val Gly Lys 
1 5 10 15 

Ser Ala Leu Thr Val Gin Phe Val Gin Gly He Phe Val Glu Lys Tyr 
20 25 30 

Asp Pro Thr He Glu Asp Ser Tyr Arg Lys Gin Val Glu Val Asp Ala 
35 40 45 

Gin Gin Cys Met Leu Glu He Leu Asp Thr Ala Gly Thr Glu Gin Phe 
50 55 60 

Thr Ala Met Arg Asp Leu Tyr Met Lys Asn Gly Gin Gly Phe Ala Leu 
65 70 75 80 
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Val Tyr Ser lie 



Arg Glu Gin lie 
100 

Leu Val Gly Asn 
115 

Glu Gin Gly Gin 
130 

Glu Ser Ser Ala 
145 

Val Arg Gin He 



Lys Ser Ser 



Thr Ala Gin Ser 
85 



Leu Arg Val Lys 



Lys Cys Asp Leu 
120 

Asn Leu Ala Arg 
13 5 

Lys Ser Lys He 
150 



Asn Arg Lys Thr 
165 



84 



Thr Phe Asn Asp 
90 



Asp Thr Asp Asp 
105 

Glu Asp Glu Arg 



Gin Trp Asn Asn 
140 

Asn Val Glu He 
155 

Pro Val Pro Gly 
170 



Leu Gin Asp Leu 
95 

Val Pro Met He 
110 

Val Val Gly Lys 
125 

Cys Ala Phe Leu 



Phe Tyr Asp Leu 
160 

Lys Ala Arg Lys 
175 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Drosophila melanogaster 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



Met Arg Glu Tyr Lys He Val Val Leu Gly Ser Gly Gly Val Gly Lys 
1 5 10 15 

Ser Ala Leu Thr Val Gin Phe Val Gin Cys He Phe Val Glu Lys Tyr 
2 0 2 5 30 

Asp Pro Thr He Glu Asp Ser Tyr Arg Lys Gin Val Glu Val Asp Gly 
35 40 45 

Gin Gin Cys Met Leu Glu He Leu Asp Thr Ala Gly Thr Glu Gin Phe 
50 55 60 

Thr Ala Met Arg Asp Leu Tyr Met Lys Asn Gly Gin Gly Phe Val Leu 

65 70 75 ~ 80 

Val Tyr Ser He Thr Ala Gin Ser Thr Phe Asn Asp Leu Gin Asp Leu 
85 90 95 

Arg Glu Gin He Leu Arg Val Lys Asp Thr Asp Asp Val Pro Met Val 
100 105 no 

Leu Val Gly Asn Lys Cys Asp Leu Glu Glu Glu Arg Val Val Gly Lys 
115 120 125 

Glu Leu Gly Lys Asn Leu Ala Thr Gin Phe Asn Cys Ala Phe Met Glu 
130 135 140 
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Thr Ser Ala Lys Ala Lys Val Asn Val Asn Asp lie Phe Tyr Asp Leu 
145 150 155 160 

Val Arg Gin lie Asn Lys Lys Ser Pro Glu Lys Lys Gin Lys Lys Pro 
165 170 175 

Lys Lys Ser Leu Cys Val Leu Leu 
180 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Dictyostel ium discoideum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 44 : 

Met Thr Glu Tyr Lys Leu Val lie Val Gly Gly Gly Gly Val Gly Lys 
15 10 15 

Ser Leu Thr He Gin Leu He Gin Asn His Phe Asp Glu Tyr Asp Pro 
20 25 30 

Thr He Glu Asp Ser Tyr Arg Lys Gin Val Ser He Asp Asp Glu Thr 
35 ~ 40 45 

Cys Leu Leu He Leu Asp Thr Ala Gly Gin Glu Glu Ser Ala Met Arg 
50 55 60 

Asp Gin Tyr Met Arg Thr Gly Gin Gly Phe Leu Cys Val Tyr Ser He 
65 * 70 75 80 

Thr Ser Arg Ser Ser Tyr Asp Glu He Ala Ser Phe Arg Glu Gin He 
85 ' 90 95 

Leu Arg Val Lys Asp Lys Asp Arg Val Pro Leu He Leu Val Gly Asn 
100 105 110 

Lys Ala Asp Leu Asp His Glu Arg Gin Val Ser Val Asn Glu Gly Gin 
115 120 125 

Glu Leu Ala Lys Asp Ser Leu Ser Phe His Glu Ser Ser Ala Lys Ser 
130 ^ ~ 135 140 

Arg He Asn Val Glu Glu Ala Phe Tyr Ser Leu Val Arg Glu He Arg 
145 150 155 160 

Lys Glu Leu Lys Gly Asp Gin Ser Ser Gly Lys Ala Gin Lys Lys Lys 
165 170 175 

Lys Gin Cys Leu He Leu 
180 

(2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 190 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Dictyostelium discoideum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Met Ser Val Ser Asn Glu Tyr Lys Leu Val Val Gly Gly Gly Gly Val 
1 5 10 15 

Gly Lys Ser Ala Leu Thr He Gin Phe Gin Asn His Phe He Glu Glu 
2 0 2 5 



30 



Tyr Asp Pro Thr He Glu Asp Ser Tyr Arg Arg Gin Cys Gin Val Asp 
35 40 45 

Glu Asp Thr Cys Leu Leu Asp He Leu Asp Thr Ala Gly Gin Asp Asp 

55 6 0 

Tyr Ser Met Arg Asp Gin Tyr Met Arg Thr Gly Gin Gly Phe Leu Val 
65 70 75 80 

Tyr Asp Val Ser Arg Thr Ser Phe Glu Glu He Asn Val Val Glu Gin 
85 90 95 

He Arg Val Lys Asp Asn Asp Lys Val Pro He Val Leu Val Gly Asn 
100 105 no 

Lys Cys Asp Leu Glu Asn Leu Arg Glu Val Thr Glu Gly Glu Gly Ser 
115 120 125 

Glu Leu Ala Lys Ser Phe Ser Val Pro Phe Leu Glu Thr Ser Ala Lvs 
"0 135 140 y 

Lys Arg Leu Asn Val Asp Glu Cys Phe Phe Glu Val Val Arq Glu He 
145 150 155 leo 

Lys Lys Ser Leu Lys Glu Pro Gly Arg Ser Lys Lys Asp Lys Lys Gly 
165 170 175 

Gly He Leu Lys Lys Phe Lys Gly Gly Asp Cys Leu He Leu 
180 185 ' 190 

(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 188 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Dictyostelium discoideum 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

Met Ser Lys Leu Leu Lys Leu Val He Val Gly Asp Gly Gly Val Gly 
1 5 10 15 

Lys Ser Ala Leu Thr He Gin Leu Thr Gin Asn Gin Phe He Ala Glu 
20 25 30 

Tyr Asp Pro Thr He Glu Asn Ser Tyr Arg Lys Gin Val Asn He Asp 
35 40 45 

Glu Glu Val Tyr Met Leu Asp He Leu Asp Thr Ala Gly Gin Glu Glu 
50 55 60 

Tyr Ser Ala Met Arg Asp Gin Tyr He Arg Ser Gly Arg Gly Phe Leu 
65 70 75 80 

lie Val Tyr Ser He He Ser Arg Ala Ser Phe Glu Ala Val Thr Thr 
85 90 95 

Phe Arg Glu Gin He Leu Arg Val Lys Asp Leu Ser Thr Tyr Pro He 
100 105 110 

Val He He Gly Asn Lys Ala Asp Leu Pro Asp Lys Asp Arg Lys Val 
115 ' 120 125 

Pro Pro Met Glu Gly Lys Glu Leu Ala Lys Phe Gly Ala Pro Phe Leu 
130 135 140 

Glu Thr Ser Ala Lys Ser Arg Val Asn Val Glu Glu Ala Phe Phe Thr 
145 150 155 160 

Leu Val Arg Glu He Lys Arg Trp Asn Gin Asn Pro Gin Asn Glu Glu 
165 170 175 

Met Leu Pro Pro Lys Lys Arg Gly Cys He He Leu 
180 185 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Dictyostel ium discoideum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Met Glu Tyr Lys Leu Val He Val Gly Gly Gly Gly Val Gly Lys Ser 
15 10 15 

Ala Leu Thr He Gin Leu He Gin Asn His Phe He Asp Glu Tyr Asp 
20 25 30 

Pro Thr He Glu Asp Ser Tyr Arg Lys Gin Val Thr He Asp Glu Glu 
35 40 45 
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Thr Cys Leu Leu Asp He Leu Asp Thr Ala Gly Gin Glu Glu Tyr Ser 
50 55 60 

Ala Met Arg Asp Gin Tyr Met Arg Thr Gly Gin Gly Phe Leu Cys Val 
65 70 75 ' 1 ~ 



80 



Tyr Ser He Thr Ser Arg Ser Ser Phe Asp Glu He Ala Ser Phe Ara 

85 90 95 s 

Glu Gin He Leu Arg Val Lys Asp Lys Asp Arg Val Pro Met He Val 

100 105 110 

Val Gly Asn Lys Cys Asp Leu Glu Ser Asp Arg Gin Val Thr Thr Gly 

115 120 ~ 125 y 

Glu Gly Gin Asp Leu Ala Lys Ser Phe Gly Ser Pro Phe Leu Glu Thr 



140 



Ser Ala Lys He Arg Val Asn Val Glu Glu Ala Phe Tyr Ser Leu Val 
145 150 155 160 

Arg Glu He Arg Lys Asp Leu Lys Gly Asp Ser Lys Pro Glu Lys Gly 
I 65 170 !7 5 7 

Lys Lys Lys Arg Pro Leu Lys Ala Cys Thr Leu Leu 
180 i 8 5 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 04 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Caenorhabdit is elegans 



(xi) SEQUENCE DE 

Met Ser Ser Ser 
1 

Ala Val Met Gly 

20 

Phe Thr Gin Asn 
35 

Gin His Ser Lys 
50 

Val Thr Asp Thr 
65 

Cys Ser Leu Asp 



Arg Lys Ser Phe 
100 



1CRIPTION: SEQ I! 

Leu Gin Ser Asn 
5 

Tyr Pro His Val 



He Phe Pro Glu 
40 

His He Ala Ala 
55 

Ala Gly Gin Gin 
70 

He Asn Gly Phe 
8 5 

Glu Met Cys Ser 



> NO : 4 8 : 

Arg Gin Ser Leu 
10 

Gly Lys Ser Ala 
25 



Arg Tyr Glu Ser 



Phe His Arg Asp 
60 

Glu Tyr Thr Val 
75 

He Leu Val Tyr 
90 

Asn He Tyr Glu 
105 



Asn Arg Lys Val 
15 

Leu Val Leu Arg 

3 0 

Thr He Glu Asp 
45 



Tyr His Leu Arg 



Phe Pro Arg Ser 
80 

Ala He Asp Asp 
95 

Lys He Val Arg 
110 
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Thr Tyr Gly Asp Thr Ser lie Pro lie Val lie Val Gly Lys Thr Asp 
115 120 125 

Leu Ser Thr Gin Val Val Arg Ala Glu Glu Gly Glu Glu Leu Ala Arg 
130 135 140 

Gin Trp Asp Ala Lys Phe Val Glu lie Thr Ala Arg Glu Ser Asn Arg 
145 150 155 " 160 

Val His Glu Val Phe Glu Leu Leu Leu Arg Glu lie Glu lie Ser Arg 
165 170 175 

Gly Asn Leu Ser Pro Thr Glu Arg Pro Asn Gly Asn Ser Pro Lys Arg 
180 185 ~ 190 

Pro Phe Lys Asp Asp Gly Lys Pro Cys Ser lie Ser 
195 * 200 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coprinus cinereus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Met Ala Ala Arg Ala Gin Phe Leu Arg Glu Tyr Lys Leu Val Val Val 
15 10 15 

Gly Gly Gly Gly Val Gly Lys Ser Ala Leu Thr lie Gin Phe lie Gin 
20 25 30 

Ser His Phe Val Asp Glu Tyr Asp Pro Thr lie Glu Asp Ser Tyr Arg 
3 5 4 0 45 

Lys Gin Cys lie lie Asp Asp Glu Val Ala Leu Leu Asp Val Leu Asp 
50 55 60 

Thr Ala Gly Gin Glu Glu Tyr Gly Ala Met Arg Glu Gin Tyr Met Arg 
65 70 75 80 

Thr Gly Glu Gly Phe Leu Leu Val Tyr Ser lie Thr Ser Arg Asn Ser 
85 90 95 

Phe Glu Glu lie Ser lie Phe His Gin Gin lie Leu Arg Val Lys Asp 
100 105 ~ no 

Gin Asp Ser Phe Pro Val lie Val Val Ala Asn Lys Cys Asp Leu Glu 
115 120 125 

Tyr Glu Arg Gin Val Gly Met Asn Glu Gly Arg Asp Leu Ala Lys His 
130 135 140 

Phe Gly Cys Lys Phe lie Glu Thr Ser Ala Lys Gin Arg lie Asn Val 
145 150 155 160 
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Asp Glu Ala Phe Ser Asn Leu Val Arg Glu lie Arg Lys Tyr Asn Arg 
165 ~ 170 " 175 

Glu Gin Gin Thr Gly Arg Pro Ala lie Ala Ala Gly Gly Gly Gly Pro 
180 185 ~ 190 

Ala Gly Ser Tyr Thr Gin Asp Arg His His Asp Glu Ala Pro Gly Cys 
195 200 205 

Cys Ala Gly Cys Val lie Ala 
210 215 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 06 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( i i i ) HYPOTHETICAL : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Geodia cydonium 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Thr Glu Tyr Lys lie Val Val Gly Gly Gly Leu Val Gly Lys Ser 
1 5 io 15 

Ala Leu Thr Leu Gin Leu Val Gin Val Cys lie Lys Asp Gin Tyr Tyr 
20 25 " ' 30 

Leu lie Glu Phe Gin Asn Asn Gin Phe Gin Phe Glu Asn Leu Gin Asn 

35 40 45 

His Tyr lie Asp Tyr Asp Pro Thr Val Glu Asp Ser Arg Arg Glu Val 
50 55 60 

Ser lie Asp Asp Gin Thr Cys Leu Asn lie Leu Asp Thr Ala Gly Gin 
65 70 75 ~ 80 

Gin His Ser Asn Ala Gin Ser Met Asp Ala His Trp Ser Thr Val Phe 
85 90 95 

Val Cys Leu Phe Asn Tyr Phe Asn lie Thr Ser Met Tyr Asp Glu lie 
100 105 no 

Ala Ser Phe Arg Glu Gin lie Leu Arg Val Lys Asp Gly Ala Lys Asp 

115 120 125 

Leu Val Pro Leu lie Leu lie He Asn Lys Ala Asp Leu Asp His Glu 
130 135 140 

Ser Gin Gly Ser Gly Asn Glu Gly Gin Leu Ala Lys Asp Ser Leu Ser 
145 150 155 " 160 

Phe His Gin Ser Ser Ala Lys Ser Arg He Asn Leu Glu Glu He Pro 
165 170 175 

Tyr Ser Leu Val Arg Glu Leu Arg Lys Glu Leu Lys Leu Asp Gin Ser 
180 185 190 
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Ser Gly Lys Ala Gin Lys Lys Lys Lys Gin Cys Leu lie He 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
( d ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Canis familiarxs 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 

Met Lys Lys Thr Tyr Asp Leu Leu Phe Lys Leu Leu Leu He Gly Asp 
1 5 10 lb 

Ser Gly Val Gly Lys Thr Cys Val Leu Phe Arg Phe Ser Asp Asp Ala 

Phe Asn Thr Thr Phe He Ser He Gly He Asp Phe Lys He Lys Thr 
35 40 " 

Val Glu Leu Gin Gly Lys Lys He Lys Leu Gin lie Trp Asp Thr Ala 
50 55 6° 

Gly Gin Glu Arg Phe His Thr lie Thr Thr Ser Tyr Tyr Arg Gly Ala 
65 70 

Met Gly He Met Leu Val Tyr Asp He Thr Asn Gly Lys Ser Phe Glu 
85 90 

Asn He Ser Lys Trp Leu Arg Asn He Asp Glu His Ala Asn Glu Asp 
100 105 

val Glu Arg Met Leu Leu Gly Asn Lys Cys Asp Met Asp Asp Lys Arg 
115 120 12 = 

Val Val Pro Lys Gly Lys Gly Glu Gin He Ala Arg Glu His Gly He 
130 135 140 

Arg Phe Phe Glu Thr Ser Ala Lys Val Asn He Asn He Glu Lys Ala 
3^45 150 155 

Phe Leu Thr Leu Ala Glu Asp He Leu Arg Lys Thr Pro Val Lys Glu 
165 17C 17 = 

Pro Asn Ser Glu Asn Val Asp He Ser Ser Gly Gly Gly Val Thr Gly 



180 

Trp Lys Ser Lys Cys Cys 
195 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 214 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Gly Thr Arg Asp Asp Glu Tyr Asp Tyr Leu Phe Lys Val Val Leu 
15 io X5 

He Gly Asp Ser Gly Val Gly Lys Ser Asn Leu Leu Ser Arg Phe Thr 
20 25 30 

Arg Asn Glu Phe Asn Leu Glu Ser Lys Ser Thr He Gly Val Glu Phe 
35 40 45 

Ala Thr Arg Ser He Gin Val Asp Gly Lys Thr He Lys Ala Gin He 
5 0 5 5 6 0 

Trp Asp Thr Ala Gly Gin Glu Arg Tyr Arg Ala He Thr Ser Ala Tvr 
65 7 0 75 80 

Tyr Arg Gly Ala Val Gly Ala Leu Leu Val Tyr Asp He Ala Lys His 
85 90 95 

Leu Thr Tyr Glu Asn Val Glu Arg Trp Leu Lys Glu Leu Arg Asp His 
100 105 no 

Ala Asp Ser Asn He Val He Met Leu Val Gly Asn Lys Ser Asp Leu 
115 120 125 

Arg His Leu Arg Ala Val Pro Thr Asp Glu Ala Arg Ala Phe Ala Glu 
130 135 140 

Lys Asn Gly Leu Ser Phe He Glu Thr Ser Ala Leu Asp Ser Thr Asn 
145 150 155 * 160 

Val Glu Ala Ala Phe Gin Thr He Leu Thr Glu He Tyr Arg He Val 
165 170 175 

Ser Gin Lys Gin Met Ser Asp Arg Glu Asn Asp Met Ser Pro Ser Asn 
180 185 190 

Asn Val Val Pro He His Val Pro Pro Thr Thr Glu Lys Pro Lys Val 
195 200 205 

Gin Cys Cys Gin Asn He 
210 
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We claim: 



1 l. A composition comprising at least one component selected 

2 from the group consisting of a peptide having an amino acid 

3 sequence selected from the group consisting of Val-Val-Ile, 

4 Ile-Lys-Arg-Val-Lys-Asp (SEQ IDNOrl), Lys-Cys-Asp-Leu-Ala 

5 (SEQ ID NO:2), Cys -Asp -Leu-Ala- Ala- Arg-Thr (SEQ ID NO : 3 ) 

6 and Asp-Leu-Ala-Ala (SEQ ID NO : 4 ) or a physiologically 

7 acceptable salt of said peptide, a cyclized peptide having 

8 a formula selected from the group consisting of cyclo [- 

9 R (1) R (2) Thr-Ile-Glu-Asp-Ser-Tyr-Arg-Lys-Gln-Val-Val- Ile- 

10 Asp R(3) R(4)-], cyclo [-R(l) R(2) Val-Val-Ile R(3) R(4)-], 

11 cyclo [-R(l) R(2) Tyr-Arg-Glu-Gln-Ile-Lys-Arg-Val-Lys-Asp- 

12 Ser-Asp-Asp-Val-Pro R(3) R(4)-], cyclo t-R(l) R(2> Lys-Arg- 

13 Val R(3) R(4)-], cyclo t-R(l) R(2) Ile-Lys-Arg-Val-Lys-Asp 

14 R{3) R(4)-], cyclo [-R(l) R(2) Gly-Asn-Lys-Cys -Asp-Leu-A; a- 

15 Ala-Arg-Thr-Val-Glu R(3) R(4)-], cyclo [-R(l) R(2) Lys-Cys- 

16 Asp-Leu-Ala R (3) R(4)-], cyclo [-R(l) R(2) Cys-Asp-Leu-Ala- 

17 Ala- Arg-Thr R (3) R(4)-], cyclo [-R(l) R{2) Asp-Leu-A; a- Ala 

18 R{3) R(4)-], cyclo [- R(l) R{2) D-Thr- Ile-Glu-Asp-Ser-Tyr- 

19 Arg-Lys-Gln-Val-D-Val-Ile-Asp R(3) R(4)-], cyclo [-R(l) 

20 R(2) D-Val-D-Val-D-Ile R(3) R(4)-], cyclo [-R(l) R(2) D- 

21 Tyr-Arg-Glu-Gln-Ile-Lys-Arg-Val -Lys-Asp-D-Ser-Asp-D-Asp-Val 

22 Pro R(3) R(4)-], cyclo [-R(l) R(2) D-Lys-D-Arg-D-Val -R ( 3 ) 

23 R(4)-], cyclo [-R(l) R(2) D-Ile-Lys-Arg-Val-Lys-D-Asp-R ( 3 ) 

24 R(4)-], cyclo [-R(l) R(2) Gly-D-Asn-Lys-Cys-Asp-Leu-D-Ala- 

25 Ala-Arg-Thr-D-Val-Glu R(3) R(4)-] r cyclo [-R(l) R(2) D-Lys- 

26 Cys-Asp-Leu-D-Ala R(3) R(4)-] / cyclo [-R(l) R(2) Cys-Asp- 

27 Leu-Ala-Ala-Arg-D-Thr R(3) R(4)-], cyclo [-R(l) R(2) Asp-D- 

28 Leu-D-Ala-D-Ala R(3) R(4)-] / and 

2 9 NH 2 

30 I 

31 O Asp 

32 1 , I 

33 H 2 N-Thr-Ile-Glu-Asp-Ser-Tyr-Arg-Lys-Gln-NH-CH 2 -C-Val-NH-CH 

34 * I i 

3 5 GH 2 CH 2 
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36 
37 
38 
39 
40 



wherein R(l) R( 2 ), R(3) and R(4) represent independently 
alanine, ornithine, cysteine, lysine, glutamic and aspartic 
acid, and wherein there is a covalent bond between the 
carboxyl and amino termini by which R(i) and R(4) are 
interconnected to each other via a methylene bridge of type 
41 -~(CH 2 ) m --or --(CH 2 ) m --M--(CH 2 ) ra .--, wherein m and m' are 

integers from 1, 2, 3, or 4, and M is NH, N[R(5)], o, or S, 
and wherein R(5) is methyl, ethyl, n-propyl, isopropyl,' 
cyclopropyl, or cyclobutyl, or the sidechain of any 
naturally occurring amino acid, and a physiologically 
acceptable salt thereof and a pept idomimetic falling within 
47 the structure 



42 
43 
44 
45 
46 



y 



* k ,u 



y^\ 



x io^ y 



i 



C—O R 

/ 

OOC- ( CH 2 ) m 

48 STRUCTURE 1 



y x 



49 wherein the sidechain R attached at the carbon atom 

50 numbered 6 on the sterol nucleus can be NH-CH 2 - CH 2 NH 3 < 



51 alkylamino, arylamino, or aralkylamino group, and wherein 

52 the sidechain attached at the carbon number 3 can be 

53 replaced with -O-C (=0) - - (CH 2 ) m - -COOH, where m is an integer 

54 from 1 to 6, inclusive, preferably from 1 to 3 , inclusive, 

55 and more preferably 2, and one of X and Y at each position 

56 independently, can be one H, a small alkyl group of C, to 

57 C 3' preferably C 1# - a halogen, preferably F, or an amino 

58 group where the other of one of X and Y is H. Preferably, 
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59 each of X and Y is H and a pharmaceut ically acceptable salt 

6 0 thereof. 

1 2. The composition of claim 1 wherein said component is a 

2 peptide has a structure having the structure 
3 

4 NH 2 

5 I 

6 0 Asp 

7 t 

8 H 0 N-Thr-Ile-Glu-Asp-Ser-Tyr-Arg-Lys-Gln-NH-CH 2 -C-Val-NH-CH 

9 I I 

10 CH 2 CH 2 

1 3 . The composition of claim 1 wherein said peptidomimetic 

2 capable of inhibiting the oncogenic activity of p21 Ras has 

3 the structure 



4 
5 
6 

7 x 1 x 

8 



y 

k ./- y 

V 



9 x ^ri ^ 13 

10 y I 1 I ^«-x 

11 x ^1 i l « /\ 

^ 15 y 



12 N 2-^ "10- 

13 y 

14 



i i 



y x 



is cr 

16 / | 

17 C=ZO R 

18 . / 

19 OOC- ( CH 2 ) m 

20 
21 

22 STRUCTURE 1 

23 

24 

25 wherein the sidechain R attached at the carbon atom 

2 6 numbered 6 on the sterol nucleus can be NH - CH 2 - CH 2 NH 3 * , 

27 alkylamino, arylamino, or aralkylamino group, and wherein 

28 the sidechain attached at the carbon number 3 can be 

29 replaced with -0-C (=0) - - (CH 2 ) m - -COOH, where m is an integer 

30 from 1 to 6, inclusive, preferably from 1 to 3 , inclusive, 

31 and more preferably 2, and one of X and Y at each position 

32 independently, can be one H, a small alkyl group of C x to 

3 3 C 3/ preferably C x ; a halogen, preferably F, or an amino 



WO 97/10836 



PCT/US96/15098 



96 

group where the other of one of X and Y is H. Preferably, 
each of X and Y is H and a pharmaceut ically acceptable salt 
thereof . 

4. The composition of claim i wherein said peptidomimetic has 
the structure 



1 2 

^ \ 17 
11^ N 13^ 



1 

I 1 I 



\ 



16 



/ | 

CZZO NH 
/ I 
OOC- CH ? CH, 

I 

CH 2 
* N H 0 



STRUCTURE 2 

or a pharmaceutical^ acceptable salt thereof. 

A method for inhibiting the oncogenic or transforming 
activity of p21 Ras, said method comprising the step of 
administering an amount of the composition of of claim l 
effective to achieve the result that the transforming 
and/or oncogenic activity of the p21 Ras protein is 
inhibited . 

A method for inhibiting the oncogenic or transforming 
activity of P 21 Ras, said method comprising the step of 
administering an amount of a composition comprising at 
least one cyclized peptide as set forth in claim l, 
effective to achieve the result that the transforming 
and/or oncogenic activity of the p21 Ras protein is 
inhibited. 
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1 7. The method of claim 6 wherein the cyclized peptide has a 

2 structure 

4 NH 2 

5 I 

6 0 ^ S P 

7 I 

8 H 2 N-Thr-Ile-Glu-Asp-Ser-Tyr-Arg-Lys-Gln-NH-CH 2 -C-Val-NH-CH 

9 I I 

10 CH 2 CH 2 

1 8. A method for inhibiting the oncogenic or transforming 

2 activity of p21 Ras, said method comprising the step of 

3 administering an effective amount of at least one 

4 peptidomimetic as set forth in claim 4, with the result 

5 that the transforming and/or oncogenic activity of the p21 

6 Ras protein is inhibited. 

1 9. The method of claim 8 wherein said peptidomimetic is 

2 3-malonoxy-6-N- (2aminoethyl) 

3 aminocyclopentanoperhydrophenanthrene . 
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